946 resultados para VISUAL INFORMATION


Relevância:

60.00% 60.00%

Publicador:

Resumo:

This paper shows initial results in deploying the biologically inspired Simultaneous Localisation and Mapping system, RatSLAM, in an outdoor environment. RatSLAM has been widely tested in indoor environments on the task of producing topologically coherent maps based on a fusion of odometric and visual information. This paper details the changes required to deploy RatSLAM on a small tractor equipped with odometry and an omnidirectional camera. The principal changes relate to the vision system, with others required for RatSLAM to use omnidirectional visual data. The initial results from mapping around a 500 m loop are promising, with many improvements still to be made.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Dislexia é uma condição neurológica associada a deficiências na aquisição e processamento da linguagem. Variando em graus de gravidade, que se manifesta por dificuldades na linguagem receptiva e expressiva, incluindo processamento fonológico, na leitura, escrita, ortografia, caligrafia, e por vezes em aritmética. Dislexia é uma condição hereditária associada a diversas anormalidades neurológicas em áreas corticais visuais e auditivas. Uma das mais influentes teorias para explicar os sintomas disléxicos é a chamada hipótese magnocelular. Segundo esta hipótese, a dislexia resulta de processamento de informações visuais anormais, devido principalmente a disfunção no sistema magnocelular. Esta dissertação explora esta hipótese comparando quinze indivíduos com dislexia e quinze controles, com idades compreendidas entre os 18 e os 30 anos através de dois testes visuais de atenção. Ambos os experimentos avaliam tempo de reação a estímulos que apareciam em toda tela do computador, enquanto os indivíduos permaneciam instalados, com a cabeça apoiada por um chin rest e com os olhos fixos em um alvo central. O experimento I consistiu de estímulos (pequenos círculos) brancos apresentados em um fundo preto. No experimento II, a mesma metodologia foi utilizada, mas agora com os estímulos (pequenos círculos) verdes sobre um fundo vermelho. Os resultados foram analisados levando em consideração os quadrantes onde os estímulos foram apresentados. Pacientes e controles não diferiram em relação ao tempo de reação a estímulos apresentados no campo visual inferior, em comparação ao quadrante superior de um mesmo indivíduo. Considerando todos os quadrantes, disléxicos tiveram tempo de reação mais lento no experimento I, mas apresentaram tempos de reação semelhantes aos controles no experimento II. Estes resultados são compatíveis com anormalidades no sistema magnocelular. As implicações destes achados para a fisiopatologia da dislexia, bem como para o seu tratamento devem ser mais discutidos.(AU)

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Following adaptation to an oriented (1-d) signal in central vision, the orientation of subsequently viewed test signals may appear repelled away from or attracted towards the adapting orientation. Small angular differences between the adaptor and test yield 'repulsive' shifts, while large angular differences yield 'attractive' shifts. In peripheral vision, however, both small and large angular differences yield repulsive shifts. To account for these tilt after-effects (TAEs), a cascaded model of orientation estimation that is optimized using hierarchical Bayesian methods is proposed. The model accounts for orientation bias through adaptation-induced losses in information that arise because of signal uncertainties and neural constraints placed upon the propagation of visual information. Repulsive (direct) TAEs arise at early stages of visual processing from adaptation of orientation-selective units with peak sensitivity at the orientation of the adaptor (theta). Attractive (indirect) TAEs result from adaptation of second-stage units with peak sensitivity at theta and theta+90 degrees , which arise from an efficient stage of linear compression that pools across the responses of the first-stage orientation-selective units. A spatial orientation vector is estimated from the transformed oriented unit responses. The change from attractive to repulsive TAEs in peripheral vision can be explained by the differing harmonic biases resulting from constraints on signal power (in central vision) versus signal uncertainties in orientation (in peripheral vision). The proposed model is consistent with recent work by computational neuroscientists in supposing that visual bias reflects the adjustment of a rational system in the light of uncertain signals and system constraints.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Image collections are ever growing and hence visual information is becoming more and more important. Moreover, the classical paradigm of taking pictures has changed, first with the spread of digital cameras and, more recently, with mobile devices equipped with integrated cameras. Clearly, these image repositories need to be managed, and tools for effectively and efficiently searching image databases are highly sought after, especially on mobile devices where more and more images are being stored. In this paper, we present an image browsing system for interactive exploration of image collections on mobile devices. Images are arranged so that visually similar images are grouped together while large image repositories become accessible through a hierarchical, browsable tree structure, arranged on a hexagonal lattice. The developed system provides an intuitive and fast interface for navigating through image databases using a variety of touch gestures. © 2012 Springer-Verlag.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Drawing on the newest findings of politeness research, this paper proposes an interactionally grounded approach to computer-mediated discourse (CMD). Through the analysis of naturally occurring text-based synchronous interactions of a virtual team the paper illustrates that the interactional politeness approach can account for linguistic phenomena not yet fully explored in computer-mediated discourse analysis. Strategies used for compensating for the lack of audio-visual information in computer-mediated communication, strategies to compensate for the technological constraints of the medium, and strategies to aid interaction management are examined from an interactional politeness viewpoint and compared to the previous findings of CMD analysis. The conclusion of this preliminary research suggests that the endeavour to communicate along the lines of politeness norms in a work-based virtual environment contradicts some of the previous findings of CMD research (unconventional orthography, capitalization, economizing), and that other areas (such as emoticons, backchannel signals and turn-taking strategies) need to be revisited and re-examined from an interactional perspective to fully understand how language functions in this merely text-based environment.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Visual information is becoming increasingly important and tools to manage repositories of media collections are highly sought after. In this paper, we focus on image databases and on how to effectively and efficiently access these. In particular, we present effective image browsing systems that are operated on a large multi-touch environment for truly interactive exploration. Not only do image browsers pose a useful alternative to retrieval-based systems, they also provide a visualisation of the whole image collection and let users explore particular parts of the collection. Our systems are based on the idea that visually similar images are located close to each other in the visualisation, that image thumbnails are arranged on a regular lattice (either a regular grid projected on a sphere or a hexagonal lattice), and that large image datasets can be accessed through a hierarchical tree structure. © 2014 International Information Institute.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Image collections are growing at a rapid rate and hence visual information is becoming more and more important. Clearly, these image repositories need to be managed, and tools for effectively and efficiently searching image databases are highly sought after, especially on mobile devices where more and more images are being stored. In this paper, we present an image browsing system for interactive exploration of image collections on mobile devices. Images are arranged so that visually similar images are grouped together while large image repositories become accessible through a hierarchical, browsable tree structure, arranged on a hexagonal lattice. The developed system provides an intuitive and fast interface for navigating through image databases using a variety of touch gestures.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Image database visualisations, in particular mapping-based visualisations, provide an interesting approach to accessing image repositories as they are able to overcome some of the drawbacks associated with retrieval based approaches. However, making a mapping-based approach work efficiently on large remote image databases, has yet to be explored. In this paper, we present Web-Based Images Browser (WBIB), a novel system that efficiently employs image pyramids to reduce bandwidth requirements so that users can interactively explore large remote image databases. © 2013 Authors.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

A man-machine system called teleoperator system has been developed to work in hazardous environments such as nuclear reactor plants. Force reflection is a type of force feedback in which forces experienced by the remote manipulator are fed back to the manual controller. In a force-reflecting teleoperation system, the operator uses the manual controller to direct the remote manipulator and receives visual information from a video image and/or graphical animation on the computer screen. This thesis presents the design of a portable Force-Reflecting Manual Controller (FRMC) for the teleoperation of tasks such as hazardous material handling, waste cleanup, and space-related operations. The work consists of the design and construction of a prototype 1-Degree-of-Freedom (DOF) FRMC, the development of the Graphical User Interface (GUI), and system integration. Two control strategies - PID and fuzzy logic controllers are developed and experimentally tested. The system response of each is analyzed and evaluated. In addition, the concept of a telesensation system is introduced, and a variety of design alternatives of a 3-DOF FRMC are proposed for future development.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Efficient and effective approaches of dealing with the vast amount of visual information available nowadays are highly sought after. This is particularly the case for image collections, both personal and commercial. Due to the magnitude of these ever expanding image repositories, annotation of all images images is infeasible, and search in such an image collection therefore becomes inherently difficult. Although content-based image retrieval techniques have shown much potential, such approaches also suffer from various problems making it difficult to adopt them in practice. In this paper, we follow a different approach, namely that of browsing image databases for image retrieval. In our Honeycomb Image Browser, large image databases are visualised on a hexagonal lattice with image thumbnails occupying hexagons. Arranged in a space filling manner, visually similar images are located close together enabling large image datasets to be navigated in a hierarchical manner. Various browsing tools are incorporated to allow for interactive exploration of the database. Experimental results confirm that our approach affords efficient image retrieval. © 2010 IEEE.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Pour être performant au plus haut niveau, les athlètes doivent posséder une capacité perceptivo-cognitive supérieure à la moyenne. Cette faculté, reflétée sur le terrain par la vision et l’intelligence de jeu des sportifs, permet d’extraire l’information clé de la scène visuelle. La science du sport a depuis longtemps observé l’expertise perceptivo-cognitive au sein de l’environnement sportif propre aux athlètes. Récemment, des études ont rapporté que l’expertise pouvait également se refléter hors de ce contexte, lors d’activités du quotidien par exemple. De plus, les récentes théories entourant la capacité plastique du cerveau ont amené les chercheurs à développer des outils pour entraîner les capacités perceptivo-cognitives des athlètes afin de les rendre plus performants sur le terrain. Ces méthodes sont la plupart du temps contextuelles à la discipline visée. Cependant, un nouvel outil d’entraînement perceptivo-cognitif, nommé 3-Dimensional Multiple Object Tracking (3D-MOT) et dénué de contexte sportif, a récemment vu le jour et a fait l’objet de nos recherches. Un de nos objectifs visait à mettre en évidence l’expertise perceptivo-cognitive spécifique et non-spécifique chez des athlètes lors d’une même étude. Nous avons évalué la perception du mouvement biologique chez des joueurs de soccer et des non-athlètes dans une salle de réalité virtuelle. Les sportifs étaient systématiquement plus performants en termes d’efficacité et de temps de réaction que les novices pour discriminer la direction du mouvement biologique lors d’un exercice spécifique de soccer (tir) mais également lors d’une action issue du quotidien (marche). Ces résultats signifient que les athlètes possèdent une meilleure capacité à percevoir les mouvements biologiques humains effectués par les autres. La pratique du soccer semble donc conférer un avantage fondamental qui va au-delà des fonctions spécifiques à la pratique d’un sport. Ces découvertes sont à mettre en parallèle avec la performance exceptionnelle des athlètes dans le traitement de scènes visuelles dynamiques et également dénuées de contexte sportif. Des joueurs de soccer ont surpassé des novices dans le test de 3D-MOT qui consiste à suivre des cibles en mouvement et stimule les capacités perceptivo-cognitives. Leur vitesse de suivi visuel ainsi que leur faculté d’apprentissage étaient supérieures. Ces résultats confirmaient des données obtenues précédemment chez des sportifs. Le 3D-MOT est un test de poursuite attentionnelle qui stimule le traitement actif de l’information visuelle dynamique. En particulier, l’attention sélective, dynamique et soutenue ainsi que la mémoire de travail. Cet outil peut être utilisé pour entraîner les fonctions perceptivo-cognitives des athlètes. Des joueurs de soccer entraînés au 3D-MOT durant 30 sessions ont montré une amélioration de la prise de décision dans les passes de 15% sur le terrain comparés à des joueurs de groupes contrôles. Ces données démontrent pour la première fois un transfert perceptivo-cognitif du laboratoire au terrain suivant un entraînement perceptivo-cognitif non-contextuel au sport de l’athlète ciblé. Nos recherches aident à comprendre l’expertise des athlètes par l’approche spécifique et non-spécifique et présentent également les outils d’entraînements perceptivo-cognitifs, en particulier le 3D-MOT, pour améliorer la performance dans le sport de haut-niveau.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Notre système visuel extrait d'ordinaire l'information en basses fréquences spatiales (FS) avant celles en hautes FS. L'information globale extraite tôt peut ainsi activer des hypothèses sur l'identité de l'objet et guider l'extraction d'information plus fine spécifique par la suite. Dans les troubles du spectre autistique (TSA), toutefois, la perception des FS est atypique. De plus, la perception des individus atteints de TSA semble être moins influencée par leurs a priori et connaissances antérieures. Dans l'étude décrite dans le corps de ce mémoire, nous avions pour but de vérifier si l'a priori de traiter l'information des basses aux hautes FS était présent chez les individus atteints de TSA. Nous avons comparé le décours temporel de l'utilisation des FS chez des sujets neurotypiques et atteints de TSA en échantillonnant aléatoirement et exhaustivement l'espace temps x FS. Les sujets neurotypiques extrayaient les basses FS avant les plus hautes: nous avons ainsi pu répliquer le résultat de plusieurs études antérieures, tout en le caractérisant avec plus de précision que jamais auparavant. Les sujets atteints de TSA, quant à eux, extrayaient toutes les FS utiles, basses et hautes, dès le début, indiquant qu'ils ne possédaient pas l'a priori présent chez les neurotypiques. Il semblerait ainsi que les individus atteints de TSA extraient les FS de manière purement ascendante, l'extraction n'étant pas guidée par l'activation d'hypothèses.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

L’objectif principal de cette thèse était d’obtenir, via l’électrophysiologie cognitive, des indices de fonctionnement post-traumatisme craniocérébral léger (TCCL) pour différents niveaux de traitement de l’information, soit l’attention sélective, les processus décisionnels visuoattentionnels et les processus associés à l’exécution d’une réponse volontaire. L’hypothèse centrale était que les mécanismes de production des lésions de même que la pathophysiologie caractérisant le TCCL engendrent des dysfonctions visuoattentionnelles, du moins pendant la période aiguë suivant le TCCL (i.e. entre 1 et 3 mois post-accident), telles que mesurées à l’aide d’un nouveau paradigme électrophysiologique conçu à cet effet. Cette thèse présente deux articles qui décrivent le travail effectué afin de rencontrer ces objectifs et ainsi vérifier les hypothèses émises. Le premier article présente la démarche réalisée afin de créer une nouvelle tâche d’attention visuospatiale permettant d’obtenir les indices électrophysiologiques (amplitude, latence) et comportementaux (temps de réaction) liés aux processus de traitement visuel et attentionnel précoce (P1, N1, N2-nogo, P2, Ptc) à l’attention visuelle sélective (N2pc, SPCN) et aux processus décisionnels (P3b, P3a) chez un groupe de participants sains (i.e. sans atteinte neurologique). Le deuxième article présente l’étude des effets persistants d’un TCCL sur les fonctions visuoattentionelles via l’obtention des indices électrophysiologiques ciblés (amplitude, latence) et de données comportementales (temps de réaction à la tâche et résultats aux tests neuropsychologiques) chez deux cohortes d’individus TCCL symptomatiques, l’une en phase subaigüe (3 premiers mois post-accident), l’autre en phase chronique (6 mois à 1 an post-accident), en comparaison à un groupe de participants témoins sains. Les résultats des articles présentés dans cette thèse montrent qu’il a été possible de créer une tâche simple qui permet d’étudier de façon rapide et peu coûteuse les différents niveaux de traitement de l’information impliqués dans le déploiement de l’attention visuospatiale. Par la suite, l’utilisation de cette tâche auprès d’individus atteints d’un TCCL testés en phase sub-aiguë ou en phase chronique a permis d’objectiver des profils d’atteintes et de récupération différentiels pour chacune des composantes étudiées. En effet, alors que les composantes associées au traitement précoce de l’information visuelle (P1, N1, N2) étaient intactes, certaines composantes attentionnelles (P2) et cognitivo-attentionnelles (P3a, P3b) étaient altérées, suggérant une dysfonction au niveau des dynamiques spatio-temporelles de l’attention, de l’orientation de l’attention et de la mémoire de travail, à court et/ou à long terme après le TCCL, ceci en présence de déficits neuropsychologiques en phase subaiguë surtout et d’une symptomatologie post-TCCL persistante. Cette thèse souligne l’importance de développer des outils diagnostics sensibles et exhaustifs permettant d’objectiver les divers processus et sous-processus cognitifs susceptible d’être atteints après un TCCL.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Abstract : Images acquired from unmanned aerial vehicles (UAVs) can provide data with unprecedented spatial and temporal resolution for three-dimensional (3D) modeling. Solutions developed for this purpose are mainly operating based on photogrammetry concepts, namely UAV-Photogrammetry Systems (UAV-PS). Such systems are used in applications where both geospatial and visual information of the environment is required. These applications include, but are not limited to, natural resource management such as precision agriculture, military and police-related services such as traffic-law enforcement, precision engineering such as infrastructure inspection, and health services such as epidemic emergency management. UAV-photogrammetry systems can be differentiated based on their spatial characteristics in terms of accuracy and resolution. That is some applications, such as precision engineering, require high-resolution and high-accuracy information of the environment (e.g. 3D modeling with less than one centimeter accuracy and resolution). In other applications, lower levels of accuracy might be sufficient, (e.g. wildlife management needing few decimeters of resolution). However, even in those applications, the specific characteristics of UAV-PSs should be well considered in the steps of both system development and application in order to yield satisfying results. In this regard, this thesis presents a comprehensive review of the applications of unmanned aerial imagery, where the objective was to determine the challenges that remote-sensing applications of UAV systems currently face. This review also allowed recognizing the specific characteristics and requirements of UAV-PSs, which are mostly ignored or not thoroughly assessed in recent studies. Accordingly, the focus of the first part of this thesis is on exploring the methodological and experimental aspects of implementing a UAV-PS. The developed system was extensively evaluated for precise modeling of an open-pit gravel mine and performing volumetric-change measurements. This application was selected for two main reasons. Firstly, this case study provided a challenging environment for 3D modeling, in terms of scale changes, terrain relief variations as well as structure and texture diversities. Secondly, open-pit-mine monitoring demands high levels of accuracy, which justifies our efforts to improve the developed UAV-PS to its maximum capacities. The hardware of the system consisted of an electric-powered helicopter, a high-resolution digital camera, and an inertial navigation system. The software of the system included the in-house programs specifically designed for camera calibration, platform calibration, system integration, onboard data acquisition, flight planning and ground control point (GCP) detection. The detailed features of the system are discussed in the thesis, and solutions are proposed in order to enhance the system and its photogrammetric outputs. The accuracy of the results was evaluated under various mapping conditions, including direct georeferencing and indirect georeferencing with different numbers, distributions and types of ground control points. Additionally, the effects of imaging configuration and network stability on modeling accuracy were assessed. The second part of this thesis concentrates on improving the techniques of sparse and dense reconstruction. The proposed solutions are alternatives to traditional aerial photogrammetry techniques, properly adapted to specific characteristics of unmanned, low-altitude imagery. Firstly, a method was developed for robust sparse matching and epipolar-geometry estimation. The main achievement of this method was its capacity to handle a very high percentage of outliers (errors among corresponding points) with remarkable computational efficiency (compared to the state-of-the-art techniques). Secondly, a block bundle adjustment (BBA) strategy was proposed based on the integration of intrinsic camera calibration parameters as pseudo-observations to Gauss-Helmert model. The principal advantage of this strategy was controlling the adverse effect of unstable imaging networks and noisy image observations on the accuracy of self-calibration. The sparse implementation of this strategy was also performed, which allowed its application to data sets containing a lot of tie points. Finally, the concepts of intrinsic curves were revisited for dense stereo matching. The proposed technique could achieve a high level of accuracy and efficiency by searching only through a small fraction of the whole disparity search space as well as internally handling occlusions and matching ambiguities. These photogrammetric solutions were extensively tested using synthetic data, close-range images and the images acquired from the gravel-pit mine. Achieving absolute 3D mapping accuracy of 11±7 mm illustrated the success of this system for high-precision modeling of the environment.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Discovering ‘photo-excess’: what difference does digital photography bring to the archaeological process, and does this difference constitute a paradigm shift from the traditional film model? Using reflexive practice, the contribution that digital photography has made to the archaeological process is explored. The themes presented in the photographs and exegesis combine visual exploration and original research to examine the role and place of archaeological photography in both a contemporary and an historical context. In contrasting the development of film-based photography of archaeology undertaken in the Eastern Mediterranean during the early 1900s with contemporary digital photography, this exegesis and creative work explores both the synergies and differences of the two photographic methods in archaeology. I introduce the term ‘photo-excess’ to describe the new role that digital photography plays in archaeological practice as compared to film, and demonstrate this difference through my creative work. At the turn of the 20th century, photography was affirmed as the major instrument for visual recording of an archaeological excavation. The combination of archaeological methods and photographic techniques from that era formed an approach to archaeological documentation and recording that was formalised by William Matthews Flinders Petrie in 1904. In this thesis I propose that Petrie became the father of modern archaeological photography through his work, and in recognition of his contribution I refer to his method as the ‘Petrie Paradigm’. Digital photography has made possible a quantum leap in the volume, quality and immediacy of visual data available to the user. Further, through the creative process, digital archaeological photography may provide visual information that exceeds the archaeologist’s original research questions, so that the digital image may sometimes exceed its primary role as a recording device. In such cases it may become the starting point for new research due to its potential photo-excess. I propose this as an emerging paradigm for archaeological photography.