14 resultados para Object length perception
em Universitat de Girona, Spain
Resumo:
We propose a probabilistic object classifier for outdoor scene analysis as a first step in solving the problem of scene context generation. The method begins with a top-down control, which uses the previously learned models (appearance and absolute location) to obtain an initial pixel-level classification. This information provides us the core of objects, which is used to acquire a more accurate object model. Therefore, their growing by specific active regions allows us to obtain an accurate recognition of known regions. Next, a stage of general segmentation provides the segmentation of unknown regions by a bottom-strategy. Finally, the last stage tries to perform a region fusion of known and unknown segmented objects. The result is both a segmentation of the image and a recognition of each segment as a given object class or as an unknown segmented object. Furthermore, experimental results are shown and evaluated to prove the validity of our proposal
Resumo:
Given a set of images of scenes containing different object categories (e.g. grass, roads) our objective is to discover these objects in each image, and to use this object occurrences to perform a scene classification (e.g. beach scene, mountain scene). We achieve this by using a supervised learning algorithm able to learn with few images to facilitate the user task. We use a probabilistic model to recognise the objects and further we classify the scene based on their object occurrences. Experimental results are shown and evaluated to prove the validity of our proposal. Object recognition performance is compared to the approaches of He et al. (2004) and Marti et al. (2001) using their own datasets. Furthermore an unsupervised method is implemented in order to evaluate the advantages and disadvantages of our supervised classification approach versus an unsupervised one
Resumo:
A new method for the automated selection of colour features is described. The algorithm consists of two stages of processing. In the first, a complete set of colour features is calculated for every object of interest in an image. In the second stage, each object is mapped into several n-dimensional feature spaces in order to select the feature set with the smallest variables able to discriminate the remaining objects. The evaluation of the discrimination power for each concrete subset of features is performed by means of decision trees composed of linear discrimination functions. This method can provide valuable help in outdoor scene analysis where no colour space has been demonstrated as being the most suitable. Experiment results recognizing objects in outdoor scenes are reported
Resumo:
Path planning and control strategies applied to autonomous mobile robots should fulfil safety rules as well as achieve final goals. Trajectory planning applications should be fast and flexible to allow real time implementations as well as environment interactions. The methodology presented uses the on robot information as the meaningful data necessary to plan a narrow passage by using a corridor based on attraction potential fields that approaches the mobile robot to the final desired configuration. It employs local and dense occupancy grid perception to avoid collisions. The key goals of this research project are computational simplicity as well as the possibility of integrating this method with other methods reported by the research community. Another important aspect of this work consist in testing the proposed method by using a mobile robot with a perception system composed of a monocular camera and odometers placed on the two wheels of the differential driven motion system. Hence, visual data are used as a local horizon of perception in which trajectories without collisions are computed by satisfying final goal approaches and safety criteria
Resumo:
This paper focuses on the problem of realizing a plane-to-plane virtual link between a camera attached to the end-effector of a robot and a planar object. In order to do the system independent to the object surface appearance, a structured light emitter is linked to the camera so that 4 laser pointers are projected onto the object. In a previous paper we showed that such a system has good performance and nice characteristics like partial decoupling near the desired state and robustness against misalignment of the emitter and the camera (J. Pages et al., 2004). However, no analytical results concerning the global asymptotic stability of the system were obtained due to the high complexity of the visual features utilized. In this work we present a better set of visual features which improves the properties of the features in (J. Pages et al., 2004) and for which it is possible to prove the global asymptotic stability
Resumo:
In this paper we face the problem of positioning a camera attached to the end-effector of a robotic manipulator so that it gets parallel to a planar object. Such problem has been treated for a long time in visual servoing. Our approach is based on linking to the camera several laser pointers so that its configuration is aimed to produce a suitable set of visual features. The aim of using structured light is not only for easing the image processing and to allow low-textured objects to be treated, but also for producing a control scheme with nice properties like decoupling, stability, well conditioning and good camera trajectory
Resumo:
Coded structured light is an optical technique based on active stereovision that obtains the shape of objects. One shot techniques are based on projecting a unique light pattern with an LCD projector so that grabbing an image with a camera, a large number of correspondences can be obtained. Then, a 3D reconstruction of the illuminated object can be recovered by means of triangulation. The most used strategy to encode one-shot patterns is based on De Bruijn sequences. In This work a new way to design patterns using this type of sequences is presented. The new coding strategy minimises the number of required colours and maximises both the resolution and the accuracy
Resumo:
Shape complexity has recently received attention from different fields, such as computer vision and psychology. In this paper, integral geometry and information theory tools are applied to quantify the shape complexity from two different perspectives: from the inside of the object, we evaluate its degree of structure or correlation between its surfaces (inner complexity), and from the outside, we compute its degree of interaction with the circumscribing sphere (outer complexity). Our shape complexity measures are based on the following two facts: uniformly distributed global lines crossing an object define a continuous information channel and the continuous mutual information of this channel is independent of the object discretisation and invariant to translations, rotations, and changes of scale. The measures introduced in this paper can be potentially used as shape descriptors for object recognition, image retrieval, object localisation, tumour analysis, and protein docking, among others
Resumo:
El estudio utiliza un diseño transversal en el campo de la percepción de la vitalidad etnolingüística. Es el primero que compara la percepción de la vitalidad etnolingüística, así como los factores asociados, entre adultos jóvenes y adultos, en relación con los grupos castellanohablantes y catalanohablantes de la Comunidad Autónoma de Cataluña. Para ello, se aplicó el 'Cuestionario de vitalidad etnolingüística subjetiva' (CVS) a una muestra de 527 participantes, 268 jóvenes y 259 adultos, de los cuales se seleccionó una submuestra de individuos que tenían el catalán como lengua materna y se identificaban como catalanes (n=301). En ambas muestras se aduce una tendencia a descriminar favorablemente la vitalidad percibida por el grupo catalán, aspecto que se acentúa significativa en el grupo de jóvenes estudiados en relación al grupo de edad de los adultos. Se discuten los resultados según las repercusiones teóricas y pragmáticas de los estudios realizados en el ámbito de la comunicación intergrupal
Resumo:
The research we present here forms part of a two-phase project - one quantitative and the other qualitative - assessing the use of primary health care services. This paper presents the qualitative phase of said research, which is aimed at ascertaining the needs, beliefs, barriers to access and health practices of the immigrant population in comparison with the native population, as well as the perceptions of healthcare professionals. Moroccan and sub-Saharan were the immigrants to who the qualitative phase was specifically addressed. The aims of this paper are as follows: to analyse any possible implications of family organisation in the health practices of the immigrant population; to ascertain social practices relating to illness; to understand the significances of sexual and reproductive health practices; and to ascertain the ideas and perceptions of immigrants, local people and professionals regarding health and the health system. Methods: qualitative research based on discursive analysis. Data gathering techniques consisted of discussion groups with health system users and semi-structured individual interviews with healthcare professionals. The sample was taken from the Basic Healthcare Areas of Salt and Banyoles (belonging to the Girona Healthcare Region), the discussion groups being comprised of (a) 6 immigrant Moroccan women, (b) 7 immigrant sub-Saharan African women and (c) 6 immigrant and native population men (2 native men, 2 Moroccan men and 2 sub-Saharan men); and the semi-structured interviews being conducted with the following healthcare professionals: (a) 3 gynaecologists, (b) 3 nurses and 1 administrative staff. Results: use of the healthcare system is linked to the perception of not being well, knowledge of the healthcare system, length of time resident in Spain and interiorization of traditional Western medicine as a cure mechanism. The divergences found among the groups of immigrants, local people and healthcare professionals with regard to healthcare education, use of the healthcare service, sexual and reproductive healthcare and reticence with regard to being attended by healthcare personnel of the opposite sex demonstrate a need to work with the immigrant population as a heterogeneous group. Conclusions: the results we have obtained support the idea that feeling unwell is a psycho-social process, as it takes place within a specific socio-cultural situation and spans a range of beliefs, perceptions and ideas regarding symptomology and how to treat it
Resumo:
El benestar psicològic, entès com la vessant psicològica que forma part del concepte més ampli de qualitat de vida, constitueix un àmbit d'estudi en expansió. Tot i tenir un passat més breu en comparació amb d'altres constructes psicosocials, cada vegada investigadors de les més diverses disciplines s'afegeixen a la llista d'estudiosos que fan del benestar psicològic un dels seus objectes d'investigació. Amb tot, l'estudi del benestar psicològic en l'adolescència constitueix probablement un dels àmbits en els quals la necessitat de seguir avançant es fa més evident. El seu estudi en subjectes adolescents té, a més, un doble interès. Per una part, els canvis i transicions que nois i noies experimenten durant l'adolescència comporten amb freqüència que sigui un període estressant per a molts d'ells/es, amb implicacions importants per al seu benestar psicològic. Aprofundir en el seu coneixement durant aquest període té un interès més enllà de l'estrictament científic i permet el disseny de programes de prevenció més ajustats a les problemàtiques que els/les adolescents puguin estar experimentant. L'exploració dels elements del benestar psicològic constitueix una de les estratègies d'aproximació al seu estudi. En aquesta tesi doctoral s'han seleccionat alguns dels elements que de la literatura científica es desprèn que tenen una connexió més estreta amb el benestar psicològic i que són la satisfacció amb la vida globalment i amb àmbits específics de la vida, l'autoestima, el suport social percebut, la percepció de control i els valors. Tot i que existeix un consens elevat en considerar que l'exploració d'aquests elements és de primera necessitat de cares a aprofundir en l'estructura del benestar psicològic, generalment han estat estudiats de forma separada, malgrat no falten intents d'integració teòrica. Les limitacions més importants que presenta l'estudi del benestar psicològic i el dels seus elements en l'actualitat són bàsicament de caràcter epistemològic i fan referència a la dificultat de trobar visions comunes (tant a nivell de definicions com de teories explicatives) compartides per una majoria d'investigadors socials. Aquestes limitacions justifiquen l'interès per dirigir l'atenció vers un altre tipus d'explicacions del benestar psicològic, qualitativament diferents a les disponibles, que no es refugiïn ni en reduccionismes ni en explicacions causals rígides. Les teories de la complexitat suposen una alternativa productiva en aquest sentit ja que aquelles característiques a través de les quals la complexitat ve donada (borrositat de límits, punts de catàstrofe, dimensions fractals, processos caòtics i no lineals), són, en definitiva, les mateixes propietats que caracteritzen als fenòmens psicosocials. I això inclou el de benestar psicològic. Les dades de les que disposem, obtingudes mitjançant un estudi transversal, impedeixen fer una aproximació al benestar psicològic des de totes les propietats de la complexitat esmentades a excepció de la característica de la no linealitat. L'objectiu general de la tesi ha estat el de construir un model de benestar psicològic a partir de les dades obtingudes que permetés: 1) Evidenciar relacions entre variables que fins aquests moments no han pogut ser massa explorades, 2) Contemplar aquestes relacions més enllà de la seva unidireccionalitat, i 3) Entendre el benestar psicològic en l'adolescència des d'un punt de vista més integrador i holista i, consegüentment, oferir una manera més comprehensiva d'aproximar-se a aquest fenomen. Aquesta tesi ha de ser entesa com un primer pas, fonamentalment metodològic, per l'elaboració futura de conceptualizacions sobre el benestar psicològic en l'adolescència que es basin en els principis que ens aporten les ciències de la complexitat. Malgrat els resultats obtinguts no estan absents de limitacions, obren noves perspectives d'anàlisi del benestar psicològic en l'adolescència.
Resumo:
Aquesta tesi tracta sobre la combinació del control visual i la llum estructurada. El control visual clàssic assumeix que elements visuals poden ser fàcilment extrets de les imatges. Això fa que objectes d'aspecte uniforme o poc texturats no es puguin tenir en compte. En aquesta tesi proposem l'ús de la llum estructurada per dotar d'elements visuals als objectes independentment de la seva aparença. En primer lloc, es presenta un ampli estudi de la llum estructurada, el qual ens permet proposar un nou patró codificat que millora els existents. La resta de la tesi es concentra en el posicionament d'un robot dotat d'una càmara respecte diferents objectes, utilitzant la informació proveïda per la projecció de diferents patrons de llum. Dos configuracions han estat estudiades: quan el projector de llum es troba separat del robot, i quan el projector està embarcat en el robot juntament amb la càmara. Les tècniques proposades en la tesi estan avalades per un ampli estudi analític i validades per resultats experimentals.
Resumo:
L'increment de bases de dades que cada vegada contenen imatges més difícils i amb un nombre més elevat de categories, està forçant el desenvolupament de tècniques de representació d'imatges que siguin discriminatives quan es vol treballar amb múltiples classes i d'algorismes que siguin eficients en l'aprenentatge i classificació. Aquesta tesi explora el problema de classificar les imatges segons l'objecte que contenen quan es disposa d'un gran nombre de categories. Primerament s'investiga com un sistema híbrid format per un model generatiu i un model discriminatiu pot beneficiar la tasca de classificació d'imatges on el nivell d'anotació humà sigui mínim. Per aquesta tasca introduïm un nou vocabulari utilitzant una representació densa de descriptors color-SIFT, i desprès s'investiga com els diferents paràmetres afecten la classificació final. Tot seguit es proposa un mètode par tal d'incorporar informació espacial amb el sistema híbrid, mostrant que la informació de context es de gran ajuda per la classificació d'imatges. Desprès introduïm un nou descriptor de forma que representa la imatge segons la seva forma local i la seva forma espacial, tot junt amb un kernel que incorpora aquesta informació espacial en forma piramidal. La forma es representada per un vector compacte obtenint un descriptor molt adequat per ésser utilitzat amb algorismes d'aprenentatge amb kernels. Els experiments realitzats postren que aquesta informació de forma te uns resultats semblants (i a vegades millors) als descriptors basats en aparença. També s'investiga com diferents característiques es poden combinar per ésser utilitzades en la classificació d'imatges i es mostra com el descriptor de forma proposat juntament amb un descriptor d'aparença millora substancialment la classificació. Finalment es descriu un algoritme que detecta les regions d'interès automàticament durant l'entrenament i la classificació. Això proporciona un mètode per inhibir el fons de la imatge i afegeix invariança a la posició dels objectes dins les imatges. S'ensenya que la forma i l'aparença sobre aquesta regió d'interès i utilitzant els classificadors random forests millora la classificació i el temps computacional. Es comparen els postres resultats amb resultats de la literatura utilitzant les mateixes bases de dades que els autors Aixa com els mateixos protocols d'aprenentatge i classificació. Es veu com totes les innovacions introduïdes incrementen la classificació final de les imatges.
Resumo:
The human visual ability to perceive depth looks like a puzzle. We perceive three-dimensional spatial information quickly and efficiently by using the binocular stereopsis of our eyes and, what is mote important the learning of the most common objects which we achieved through living. Nowadays, modelling the behaviour of our brain is a fiction, that is why the huge problem of 3D perception and further, interpretation is split into a sequence of easier problems. A lot of research is involved in robot vision in order to obtain 3D information of the surrounded scene. Most of this research is based on modelling the stereopsis of humans by using two cameras as if they were two eyes. This method is known as stereo vision and has been widely studied in the past and is being studied at present, and a lot of work will be surely done in the future. This fact allows us to affirm that this topic is one of the most interesting ones in computer vision. The stereo vision principle is based on obtaining the three dimensional position of an object point from the position of its projective points in both camera image planes. However, before inferring 3D information, the mathematical models of both cameras have to be known. This step is known as camera calibration and is broadly describes in the thesis. Perhaps the most important problem in stereo vision is the determination of the pair of homologue points in the two images, known as the correspondence problem, and it is also one of the most difficult problems to be solved which is currently investigated by a lot of researchers. The epipolar geometry allows us to reduce the correspondence problem. An approach to the epipolar geometry is describes in the thesis. Nevertheless, it does not solve it at all as a lot of considerations have to be taken into account. As an example we have to consider points without correspondence due to a surface occlusion or simply due to a projection out of the camera scope. The interest of the thesis is focused on structured light which has been considered as one of the most frequently used techniques in order to reduce the problems related lo stereo vision. Structured light is based on the relationship between a projected light pattern its projection and an image sensor. The deformations between the pattern projected into the scene and the one captured by the camera, permits to obtain three dimensional information of the illuminated scene. This technique has been widely used in such applications as: 3D object reconstruction, robot navigation, quality control, and so on. Although the projection of regular patterns solve the problem of points without match, it does not solve the problem of multiple matching, which leads us to use hard computing algorithms in order to search the correct matches. In recent years, another structured light technique has increased in importance. This technique is based on the codification of the light projected on the scene in order to be used as a tool to obtain an unique match. Each token of light is imaged by the camera, we have to read the label (decode the pattern) in order to solve the correspondence problem. The advantages and disadvantages of stereo vision against structured light and a survey on coded structured light are related and discussed. The work carried out in the frame of this thesis has permitted to present a new coded structured light pattern which solves the correspondence problem uniquely and robust. Unique, as each token of light is coded by a different word which removes the problem of multiple matching. Robust, since the pattern has been coded using the position of each token of light with respect to both co-ordinate axis. Algorithms and experimental results are included in the thesis. The reader can see examples 3D measurement of static objects, and the more complicated measurement of moving objects. The technique can be used in both cases as the pattern is coded by a single projection shot. Then it can be used in several applications of robot vision. Our interest is focused on the mathematical study of the camera and pattern projector models. We are also interested in how these models can be obtained by calibration, and how they can be used to obtained three dimensional information from two correspondence points. Furthermore, we have studied structured light and coded structured light, and we have presented a new coded structured light pattern. However, in this thesis we started from the assumption that the correspondence points could be well-segmented from the captured image. Computer vision constitutes a huge problem and a lot of work is being done at all levels of human vision modelling, starting from a)image acquisition; b) further image enhancement, filtering and processing, c) image segmentation which involves thresholding, thinning, contour detection, texture and colour analysis, and so on. The interest of this thesis starts in the next step, usually known as depth perception or 3D measurement.