997 resultados para Vision par ordinateur
Resumo:
I avhandlingen analyseras arbetsprocesserna vid en webbredaktion. Undersökningen är en etnografisk fallstudie där Hufvudstadsbladets webbproduktion fungerar som case. Den övergripande frågeställningen är hur växelverkan mellan tidningsredaktionen och webbredaktionen fungerar och varför. Syftet var att hitta och synliggöra de underliggande spänningar i organisationen som bimedialiteten kan ha gett upphov till, och analysera produktionen mot bakgrund av tidigare forskning. Analysen behandlar tre områden som ofta återkommer i mediekonvergensforskningen, det vill säga organisation, innehåll och inställning. Forskningsmaterialet är insamlat med hjälp av observation, intervjuer och en e-postenkät. Arbetet på redaktionen observerades under sex arbetsskift. Webbreporterns arbete observerades, observationerna antecknades och efter varje arbetsskift bandades en intervju med webbreportern. Utöver dessa intervjuer gjordes ytterligare tre intervjuer med två nyhetschefer och chefredaktören. En e-postenkät med öppna frågor skickades ut till samtliga redaktionsmedlemmar. Avhandlingen tar avstamp i mediekonvergensforskning, redaktionsforskning och aktivitetsteori. Eftersom den teoretiska utgångspunkten delvis ligger inom aktivitetsteori och utvecklande arbetsforskning räknades samtidigt störningar i arbetsprocessen för att kunna identifiera underliggande spänningar i organisationen. Alla händelser som innebar ett längre eller kortare avbrott i arbetsprocessen antecknades och delades in i kategorier. Sammanlagt sextio störningar identifierades, varav den största andelen, en tredjedel, konstaterades bero på organisations- och kommunikationsfaktorer, främst till följd av bristfällig intern kommunikation. Slutsatserna är webbproduktionen till följd av heterogena objekt i aktivitetssystemet - oklara mål och oklarhet gällande webbens roll i organisationen sitter fast i klyftan mellan ledningens vision och verkligheten på redaktionen. Ett flertal motstridiga uppfattningar om webbproduktionens roll råder på redaktionen. Det leder till störningar i arbetsprocessen som i sin tur gör att produktionen haltar och inte utvecklas. Oklarheten kring målen leder till oklarhet kring konkret praxis, kommunikationssvårigheter, missförstånd och en sned arbetsfördelning, som samtliga inverkar på smidigheten i produktionen.
Resumo:
Modern smart phones often come with a significant amount of computational power and an integrated digital camera making them an ideal platform for intelligents assistants. This work is restricted to retail environments, where users could be provided with for example navigational in- structions to desired products or information about special offers within their close proximity. This kind of applications usually require information about the user's current location in the domain environment, which in our case corresponds to a retail store. We propose a vision based positioning approach that recognizes products the user's mobile phone's camera is currently pointing at. The products are related to locations within the store, which enables us to locate the user by pointing the mobile phone's camera to a group of products. The first step of our method is to extract meaningful features from digital images. We use the Scale- Invariant Feature Transform SIFT algorithm, which extracts features that are highly distinctive in the sense that they can be correctly matched against a large database of features from many images. We collect a comprehensive set of images from all meaningful locations within our domain and extract the SIFT features from each of these images. As the SIFT features are of high dimensionality and thus comparing individual features is infeasible, we apply the Bags of Keypoints method which creates a generic representation, visual category, from all features extracted from images taken from a specific location. A category for an unseen image can be deduced by extracting the corresponding SIFT features and by choosing the category that best fits the extracted features. We have applied the proposed method within a Finnish supermarket. We consider grocery shelves as categories which is a sufficient level of accuracy to help users navigate or to provide useful information about nearby products. We achieve a 40% accuracy which is quite low for commercial applications while significantly outperforming the random guess baseline. Our results suggest that the accuracy of the classification could be increased with a deeper analysis on the domain and by combining existing positioning methods with ours.
Resumo:
This paper is concerned with grasping biological cells in aqueous medium with miniature grippers that can also help estimate forces using vision-based displacement measurement and computation. We present the design, fabrication, and testing of three single-piece, compliant miniature grippers with parallel and angular jaw motions. Two grippers were designed using experience and intuition, while the third one was designed using topology optimization with implicit manufacturing constraints. These grippers were fabricated using different manufacturing techniques using spring steel and polydimethylsiloxane ( PDMS). The grippers also serve the purpose of a force sensor. Toward this, we present a vision-based force-sensing technique by solving Cauchy's problem in elasticity using an improved algorithm. We validated this technique at the macroscale, where there was an independent method to estimate the force. In this study, the gripper was used to hold a yeast ball and a zebrafish egg cell of less than 1 mm in diameter. The forces involved were estimated to be about 30 and 10 mN for the yeast ball and the zebrafish egg cell, respectively.
Resumo:
Relying on Merleau-Ponty's phenomenology of perception and on Mircea Eliade's works on the Sacred and the Profane, this study explores the river as a perceptual space and as the sacred Center in a cosmic vision of the world in twelve of Jean-Marie Gustave Le Clézio's fictional works, from The Interrogation (1963) to Revolutions (2003). In the first chapter, after introducing the field of study, I discuss the relation between the radical subjectivity and the evasiveness of perceiving subjects in Le Clézio's fiction. Next are some thoughts on the relation between Merleau-Ponty's and Le Clézio's ideas. The second chapter studies the river as an experience in the text, first as a topographical space, then as a sound world. The investigations move on to its water as a visual and a tactile phenomenon. Then follows the human use of the river, the (absence of) baths, and the river as a traveling space. The chapter closes with the study of the metaphorical use of the word, occurring mainly in urban space and for phenomena in the sky. The third chapter is organized around the river as the Center of the world in a religious cosmogony, where the river represents the origin of the world and of the human race. The core analysis shows how the middle of the river is a symbolic space of a new beginning. As a sacred space, the river abolishes time as the object of contemplation and as relative immobility from the point of view of a person drifting downstream. The functions of a new beginning and of abolition of time are combined in the symbolic immersions in the water. Finally, the dissertation explores other symbolical spaces, such as the unknown destination of the drift, and the river as the Center of a utopia. The chapter closes with the existential agony as a result of the elimination of the Center in the urban environment. In the final chapter, the river is compared to other watercourses : the creek, the brook and the rapids. The river is more of a spatial entity, whereas the actual water is more important in the smaller watercourses. The river is more common than the other watercourses as a topographical element in the landscape, whereas the minor watercourses invite the characters to a closer contact with their element, in immersions and in drinking their water. Finally, the work situates the rivers in a broader context of different fictional spaces in Le Clézio's text.
Resumo:
An attempt is made to present some challenging problems (mainly to the technically minded researchers) in the development of computational models for certain (visual) processes which are executed with, apparently, deceptive ease by the human visual system. However, in the interest of simplicity (and with a nonmathematical audience in mind), the presentation is almost completely devoid of mathematical formalism. Some of the findings in biological vision are presented in order to provoke some approaches to their computational models, The development of ideas is not complete, and the vast literature on biological and computational vision cannot be reviewed here. A related but rather specific aspect of computational vision (namely, detection of edges) has been discussed by Zucker, who brings out some of the difficulties experienced in the classical approaches.Space limitations here preclude any detailed analysis of even the elementary aspects of information processing in biological vision, However, the main purpose of the present paper is to highlight some of the fascinating problems in the frontier area of modelling mathematically the human vision system.
Resumo:
This paper presents a novel hypothesis on the function of massive feedback pathways in mammalian visual systems. We propose that the cortical feature detectors compete not for the right to represent the output at a point, but for exclusive rights to abstract and represent part of the underlying input. Feedback can do this very naturally. A computational model that implements the above idea for the problem of line detection is presented and based on that we suggest a functional role for the thalamo-cortical loop during perception of lines. We show that the model successfully tackles the so called Cross problem. Based on some recent experimental results, we discuss the biological plausibility of our model. We also comment on the relevance of our hypothesis (on the role of feedback) to general sensory information processing and recognition. (C) 1998 Published by Elsevier Science Ltd. All rights reserved.
Resumo:
This paper presents concepts, designs, and working prototypes of enhanced laparoscopic surgical tools. The enhancements are in equipping the tool with force and temperature sensing as well as image acquisition for stereo vision. Just as the pupils of our eyes are adequately spaced out and the distance between them is adjustable, two minute cameras mounted on a mechanism in our design can be moved closer or farther apart inside the inflated abdomen during the surgery. The cameras are fitted to a deployable mechanism consisting of flexural joints so that they can be inserted through a small incision and then deployed and moved as needed.A temperature sensor and a force sensor are mounted on either of the gripping faces of the surgical grasping tool to measure the temperature and gripping force, which need to be controlled for safe laparoscopic surgery. The sensors are small enough and hence they do not cause interference during surgery and insertion.Prototyping and working of the enhanced laparoscopic tool are presented with details
Resumo:
Digital human modeling (DHM) involves modeling of structure, form and functional capabilities of human users for ergonomics simulation. This paper presents application of geometric procedures for investigating the characteristics of human visual capabilities which are particularly important in the context mentioned above. Using the cone of unrestricted directions through the pupil on a tessellated head model as the geometric interpretation of the clinical field-of-view (FoV), the results obtained are experimentally validated. Estimating the pupil movement for a given gaze direction using Listing's Law, FoVs are re-computed. Significant variation of the FoV is observed with the variation in gaze direction. A novel cube-grid representation, which integrated the unit-cube representation of directions and the enhanced slice representation has been introduced for fast and exact point classification for point visibility analysis for a given FoV. Computation of containment frequency of every grid-cell for a given set of FoVs enabled determination of percentile-based FoV contours for estimating the visual performance of a given population. This is a new concept which makes visibility analysis more meaningful from ergonomics point-of-view. The algorithms are fast enough to support interactive analysis of reasonably complex scenes on a typical desktop computer. (C) 2011 Elsevier Ltd. All rights reserved.
Resumo:
The solvated metal atom dispersion (SMAD) method has been used for the synthesis of colloids of metal nanoparticles. It is a top-down approach involving condensation of metal atoms in low temperature solvent matrices in a SMAD reactor maintained at 77 K. Warming of the matrix results in a slurry of metal atoms that interact with one another to form particles that grow in size. The organic solvent solvates the particles and acts as a weak capping agent to halt/slow down the growth process to a certain extent. This as-prepared colloid consists of metal nanoparticles that are quite polydisperse. In a process termed as digestive ripening, addition of a capping agent to the as-prepared colloid which is polydisperse renders it highly monodisperse either under ambient or thermal conditions. In this, as yet not well-understood process, smaller particles grow and the larger ones diminish in size until the system attains uniformity in size and a dynamic equilibrium is established. Using the SMAD method in combination with digestive ripening process, highly monodisperse metal, core-shell, alloy, and composite nanoparticles have been synthesized. This article is a review of our contributions together with some literature reports on this methodology to realize various nanostructured materials.