979 resultados para Visual image
Resumo:
The eye is the major organ of vision and highly specialized for photoreception. It focusses light from an object onto the light-sensitive retina. Changes in specialized neurons in the retina result in nerve action potentials which are relayed to the brain via the optic nerve. Visual processing by the brain results in ‘visual perception’, the construction of a sensory image which is consciously appreciated as vision. All other structures of the eye are subsidiary to this function, either by facilitating focusing of light rays or by supporting the tissues of the eye. This chapter is an introduction to the various parts of the eye including the eyelids and associated structures, conjunctiva, cornea, sclera, iris, lens, vitreous body, retina, optic disc and nerve, and orbit. This chapter describes the functions of these various structures and their importance in achieving a visual image.
Resumo:
Thirteen international netballers viewed static images of scenarios taken from netball open play. Two ‘team mates’, each marked by one opponent, could be seen in each image; each team mate-opponent pair was located on opposite sides of the vertical meridian, such that a binary response was required (‘left’ or ‘right’) from the participant, in order to select a team mate to whom they would pass the ball. For each trial, a spoken word (“left”/“right”) was presented monaurally at the onset of the visual image. Spatially invalid auditory cues (i.e., in the ear contralateral to the correct passing option), reduced performance accuracy relative to valid ones. Semantically invalid cues (e.g., a call of “left” when the target was right-located), increased response times relative to valid ones. However, there were no accompanying changes in visual attention to the team mates and their markers. The effects of auditory cues on covert attentional shifts and decision-making are discussed.
Resumo:
Dissertação de Mestrado para obtenção do grau de Mestre em Design de Comunicação, apresentada na Universidade de Lisboa - Faculdade de Arquitectura.
Resumo:
In this paper we face the problem of positioning a camera attached to the end-effector of a robotic manipulator so that it gets parallel to a planar object. Such problem has been treated for a long time in visual servoing. Our approach is based on linking to the camera several laser pointers so that its configuration is aimed to produce a suitable set of visual features. The aim of using structured light is not only for easing the image processing and to allow low-textured objects to be treated, but also for producing a control scheme with nice properties like decoupling, stability, well conditioning and good camera trajectory
Resumo:
Robotic platforms have advanced greatly in terms of their remote sensing capabilities, including obtaining optical information using cameras. Alongside these advances, visual mapping has become a very active research area, which facilitates the mapping of areas inaccessible to humans. This requires the efficient processing of data to increase the final mosaic quality and computational efficiency. In this paper, we propose an efficient image mosaicing algorithm for large area visual mapping in underwater environments using multiple underwater robots. Our method identifies overlapping image pairs in the trajectories carried out by the different robots during the topology estimation process, being this a cornerstone for efficiently mapping large areas of the seafloor. We present comparative results based on challenging real underwater datasets, which simulated multi-robot mapping
Resumo:
I den första delen av den här avhandlingen presenteras en bildens genealogi. Den skildrar hur begreppen för bilden, seendet och jaget utvecklades i relation till varandra i en specifik vetenskaplig och filosofisk kontext. Berättelsen sträcker sig från den tidiga renässansen och det perspektivistiska måleriet, till fotografiets födelse och positivismen. Den här utvecklingen medförde en form av reduktionism i vilken jagets roll – betydelsen av den mänskliga psykologin, vårt omdöme, vår uppmärksamhet och vår vilja – blev förbisedd. Inom den här tanketraditionen uppstod en förskjutning, från en förståelse av bilden som en representation av det tredimensionella rummet på en tvådimensionell yta, till en uppfattning om bilden som en genomskinlig ruta, ett fönster ut mot världen. Idén om avbildningen som en neutral ”blick från ingenstans” kom att förstärka en skeptisk hållning till kommunikation, dialog och vittnesmål och därmed även undergräva vår tillit till varandra och följaktligen vår tillit till oss själva. I den andra delen erbjuder författaren ett alternativ till den tanketradition som behandlas i den första delen. Det som blev förbisett i uppfattningen om en blick från ingenstans var att bilden är ett hjälpmedel då vi bearbetar vårt synfält. Bilden hjälper oss att dela vår syn på saker. Genom den här uppgiften av att dela blir bilden riktningsgivande i våra försök att orientera oss i världen. Jag kan stå bredvid en annan människa och se vad hon ser, men jag vet inte nödvändigtvis hur hon uppfattar det vi ser. Bilden lägger till ett led i det här förhållandet eftersom den inte enbart visar vad den andra ser. När bilden fungerar som den skall visar den också hur den andra ser och på det här sättet blir bilden verksam. Den föreliggande avhandlingen kombinerar epistemologi med vetenskapshistoria och visuella kulturstudier, men dess huvudintresse är filosofiskt. Den befattar sig med filosofiska missförstånd angående avbildning som en mimetisk konstform, kunskap som domesticering och varseblivning som mottagning av data. ------------------------------------------------------ Tämän väitöskirjan ensimmäinen osa selvittää kuvakäsitteen genealogiaa. Se havainnollistaa miten kuvan, näkemisen ja minän käsitteet kehittyivät suhteessa toisiinsa. Kertomus ulottuu varhaisesta renessanssista ja perspektivistisestä maalaustaiteesta, positivismin aikakauteen ja valokuvan syntyyn. Tämä kehitys toi mukanaan reduktionismin jossa minän rooli – ihmisen psykologian merkitys, meidän arviointikyky, meidän huomiokyky sekä meidän tahtomme – vaipui unohduksiin. Ajatusmaailmassa tapahtui siirtymä, kuvan merkitys vaihtui käsityksestä jossa se on kolmiulotteisen tilan representaatio kaksiulotteisella pinnalla, käsitykseen jossa kuva on läpinäkyvä ruutu, ikkuna kohti maailmaa. Ajatus kuvasta neutraalin näkökulman kantajana vahvisti skeptistä suhtautumista kommunikaatiota, dialogisuutta ja subjektiivisuutta kohtaan. Tämä skeptisyys ilmentyi myös vahvana epäluottamuksena ihmiskeskeisyyttä ja toiseutta kohtaan. Toisessa osassa tekijä tarjoaa vaihtoehdon tälle skeptiselle ajatusmaailmalle jota tarkastellaan ensimmäisessä osassa. Kuva on myös väline joka auttaa meitä jäsentämään meidän näkökenttäämme. Se auttaa meitä jakamaan meidän käsityksiä toistemme kanssa. Tämä näkemisen jakamisen käytäntö on kuvan keskeinen tehtävä. Voin seistä toisen ihmisen vieressä ja nähdä samat asiat kuin hän, mutta en välttämättä ymmärrä miten hän näkee nämä asiat. Kuva lisää jotain olennaista tähän suhteeseen. Kun kuva toimii niin kun sen kuuluu toimia, se näyttää myös miten toinen näkee, tällä tavalla kuvasta tulee välittäjä. Tämä väitöskirja yhdistää epistemologiaa, tieteen historiaa ja visuaalisen kulttuurin tutkimusta, mutta sen pääasiallinen tavoite on filosofinen. Se käsittelee filosofisia väärinkäsityksiä koskien kuvan eideettisyyttä.
Resumo:
This thesis is an outcome of the investigations carried out on the development of an Artificial Neural Network (ANN) model to implement 2-D DFT at high speed. A new definition of 2-D DFT relation is presented. This new definition enables DFT computation organized in stages involving only real addition except at the final stage of computation. The number of stages is always fixed at 4. Two different strategies are proposed. 1) A visual representation of 2-D DFT coefficients. 2) A neural network approach. The visual representation scheme can be used to compute, analyze and manipulate 2D signals such as images in the frequency domain in terms of symbols derived from 2x2 DFT. This, in turn, can be represented in terms of real data. This approach can help analyze signals in the frequency domain even without computing the DFT coefficients. A hierarchical neural network model is developed to implement 2-D DFT. Presently, this model is capable of implementing 2-D DFT for a particular order N such that ((N))4 = 2. The model can be developed into one that can implement the 2-D DFT for any order N upto a set maximum limited by the hardware constraints. The reported method shows a potential in implementing the 2-D DF T in hardware as a VLSI / ASIC
Resumo:
We present a statistical image-based shape + structure model for Bayesian visual hull reconstruction and 3D structure inference. The 3D shape of a class of objects is represented by sets of contours from silhouette views simultaneously observed from multiple calibrated cameras. Bayesian reconstructions of new shapes are then estimated using a prior density constructed with a mixture model and probabilistic principal components analysis. We show how the use of a class-specific prior in a visual hull reconstruction can reduce the effect of segmentation errors from the silhouette extraction process. The proposed method is applied to a data set of pedestrian images, and improvements in the approximate 3D models under various noise conditions are shown. We further augment the shape model to incorporate structural features of interest; unknown structural parameters for a novel set of contours are then inferred via the Bayesian reconstruction process. Model matching and parameter inference are done entirely in the image domain and require no explicit 3D construction. Our shape model enables accurate estimation of structure despite segmentation errors or missing views in the input silhouettes, and works even with only a single input view. Using a data set of thousands of pedestrian images generated from a synthetic model, we can accurately infer the 3D locations of 19 joints on the body based on observed silhouette contours from real images.
Resumo:
In this paper we face the problem of positioning a camera attached to the end-effector of a robotic manipulator so that it gets parallel to a planar object. Such problem has been treated for a long time in visual servoing. Our approach is based on linking to the camera several laser pointers so that its configuration is aimed to produce a suitable set of visual features. The aim of using structured light is not only for easing the image processing and to allow low-textured objects to be treated, but also for producing a control scheme with nice properties like decoupling, stability, well conditioning and good camera trajectory
Resumo:
Scene classification based on latent Dirichlet allocation (LDA) is a more general modeling method known as a bag of visual words, in which the construction of a visual vocabulary is a crucial quantization process to ensure success of the classification. A framework is developed using the following new aspects: Gaussian mixture clustering for the quantization process, the use of an integrated visual vocabulary (IVV), which is built as the union of all centroids obtained from the separate quantization process of each class, and the usage of some features, including edge orientation histogram, CIELab color moments, and gray-level co-occurrence matrix (GLCM). The experiments are conducted on IKONOS images with six semantic classes (tree, grassland, residential, commercial/industrial, road, and water). The results show that the use of an IVV increases the overall accuracy (OA) by 11 to 12% and 6% when it is implemented on the selected and all features, respectively. The selected features of CIELab color moments and GLCM provide a better OA than the implementation over CIELab color moment or GLCM as individuals. The latter increases the OA by only ∼2 to 3%. Moreover, the results show that the OA of LDA outperforms the OA of C4.5 and naive Bayes tree by ∼20%. © 2014 Society of Photo-Optical Instrumentation Engineers (SPIE) [DOI: 10.1117/1.JRS.8.083690]
Resumo:
Multidimensional Visualization techniques are invaluable tools for analysis of structured and unstructured data with variable dimensionality. This paper introduces PEx-Image-Projection Explorer for Images-a tool aimed at supporting analysis of image collections. The tool supports a methodology that employs interactive visualizations to aid user-driven feature detection and classification tasks, thus offering improved analysis and exploration capabilities. The visual mappings employ similarity-based multidimensional projections and point placement to layout the data on a plane for visual exploration. In addition to its application to image databases, we also illustrate how the proposed approach can be successfully employed in simultaneous analysis of different data types, such as text and images, offering a common visual representation for data expressed in different modalities.
Resumo:
Studies of DNA damage in gastric epithelial cells of Helicobacter pylori (H. pylori)-infected patients are conflicting, possibly due to different methods used for scoring DNA damage by Comet assay. Therefore, we compared the sensitivity of visual microscopic analysis (arbitrary units-scores and comets%) and image analysis system (tail moment), in the gastric epithelial cells from the antrum and corpus of 122 H. pylori-infected and 32 non-infected patients. The feasibility of cryopreserved peripheral blood lymphocytes and whole-blood cells for DNA damage biomonitoring was also investigated. In the antrum, the levels of DNA damage were significantly higher in H. pylori-infected patients with gastritis than in non-infected patients with normal mucosa, when evaluated by image analysis system, arbitrary units and comets%. In the corpus, the comets% was not sufficiently sensitive to detect the difference between H. pylori-infected patients with gastritis and non-infected patients with normal mucosa. The image analysis system was sensitive enough to detect differences between non-infected patients and H. pylori-infected patients with mild gastritis and between infected patients with moderate and severe gastritis, in both antrum, and corpus, while arbitrary units and comets% were unable to detect these differences. In cryopreserved peripheral blood lymphocytes, the levels of DNA damage (tail moment) were significantly higher in H. pylori-infected patients with moderate and severe gastritis than in non-infected patients. Overall, our results indicate that the image analysis system is more sensitive and adequate to measure the levels of DNA damage in gastric epithelial cells than the other methods assayed. (c) 2005 Elsevier B.V. All rights reserved.
Resumo:
Different from the first attempts to solve the image categorization problem (often based on global features), recently, several researchers have been tackling this research branch through a new vantage point - using features around locally invariant interest points and visual dictionaries. Although several advances have been done in the visual dictionaries literature in the past few years, a problem we still need to cope with is calculation of the number of representative words in the dictionary. Therefore, in this paper we introduce a new solution for automatically finding the number of visual words in an N-Way image categorization problem by means of supervised pattern classification based on optimum-path forest. © 2011 IEEE.