995 resultados para Visual Matching


Relevância:

60.00% 60.00%

Publicador:

Resumo:

In studies of mirror-self-recognition subjects are usually surreptitiously marked on their head, and then presented with a mirror. Scores of studies have established that by 18 to 24 months, children investigate their own head upon seeing the mark in the mirror. Scores of papers have debated what this means. Suggestions range from rich interpretations (e.g., the development of self-awareness) to lean accounts (e.g., the development of proprioceptivevisual matching), and include numerous more moderate proposals (e.g., the development of a concept of one's face). In Study 1, 18-24-monthold toddlers were given the standard test and a novel task in which they were marked on their legs rather than on their face. Toddlers performed equivalently on both tasks, suggesting that passing the test does not rely on information specific to facial features. In Study 2, toddlers were surreptitiously slipped into trouser legs that were prefixed to a highchair. Toddlers failed to retrieve the sticker now that their legs looked different from expectations. This finding, together with the findings from a third study which showed that self-recognition in live video feedback develops later than mirror selfrecognition, suggests that performance is not solely the result of proprioceptive-visual matching.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

As the popularity of digital videos increases, a large number illegal videos are being generated and getting published. Video copies are generated by performing various sorts of transformations on the original video data. For effectively identifying such illegal videos, the image features that are invariant to various transformations must be extracted for performing similarity matching. An image feature can be its local feature or global feature. Among them, local features are powerful and have been applied in a wide variety of computer vision aplications .This paper focuses on various recently proposed local detectors and descriptors that are invariant to a number of image transformations.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Objective: There is convincing evidence that phonological, orthographic and semantic processes influence children’s ability to learn to read and spell words. So far only a few studies investigated the influence of implicit learning in literacy skills. Children are sensitive to the statistics of their learning environment. By frequent reading they acquire implicit knowledge about the frequency of letter patterns in written words, and they use this knowledge during reading and spelling. Additionally, semantic connections facilitate to storing of words in memory. Thus, the aim of the intervention study was to implement a word-picture training which is based on statistical and semantic learning. Furthermore, we aimed at examining the training effects in reading and spelling in comparison to an auditory-visual matching training and a working memory training program. Participants and Methods: One hundred and thirty-two children aged between 8 and 11 years participated in training in three weekly session of 12 minutes over 8 weeks, and completed other assessments of reading, spelling, working memory and intelligence before and after training. Results: Results revealed in general that the word-picture training and the auditory-visual matching training led to substantial gains in reading and spelling performance in comparison to the working-memory training. Although both children with and without learning difficulties profited in their reading and spelling after the word-picture training, the training program led to differential effects for the two groups. After the word-picture training on the one hand, children with learning difficulties profited more in spelling as children without learning difficulties, on the other hand, children without learning difficulties benefit more in word comprehension. Conclusions: These findings highlight the need for frequent reading trainings with semantic connections in order to support the acquisition of literacy skills.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Our last study with regularly developed children demonstrated a positive effect of working memory training on cognitive abilities. Building upon these findings, the aim of this multidisciplinary study is to investigate the effects of training of core functions with children who are suffering from different learning disabilities, like AD/HD, developmental dyslexia or specific language impairment. In addition to working memory training (BrainTwister), we apply a perceptual training, which concentrates on auditory-visual matching (Audilex), as well as an implicit concept learning task. We expect differential improvements of mental capacities, specifically of executive functions (working memory, attention, auditory and visual processing), scholastic abilities (language and mathematical skills), as well as of problem solving. With that, we hope to find further directions regarding helpful and individually adapted interventions in educational settings. Interested parties are invited to discuss and comment the design, the research question, and the possibilities in recruiting the subjects.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Do capuchin monkeys respond to photos as icons? Do they discriminate photos of capuchin monkeys' faces? Looking for answers to these questions we trained three capuchin monkeys in simple and conditional discrimination tasks and tested the discriminations when comparison stimuli were partially covered. Three capuchin monkeys experienced in simultaneous simple discrimination and IDMTS were trained with repeated shifts of simple discriminations (RSSD), with four simultaneous choices, and IDMTS (1 s delay, 4 choices) with pictures of known capuchins monkeys' faces. All monkeys did discriminate the pictures in both procedures. Performances in probes with partial masks with one fourth of the stimulus hidden were consistent with baseline level. Errors occurred when a picture similar to the correct one was available among the comparison stimuli, when the covered part was the most distinct, or when pictures displayed the same monkey. Capuchin monkeys do match pictures of capuchin monkeys' faces to the sample. The monkeys treated different pictures of the same monkey as equivalent, suggesting that they respond to the pictures as icons, although this was not true to pictures of other monkeys. Subsequent studies may bring more evidence that capuchin monkeys treat pictures as depictions of real scenes.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

In this paper, we present a novel coarse-to-fine visual localization approach: contextual visual localization. This approach relies on three elements: (i) a minimal-complexity classifier for performing fast coarse localization (submap classification); (ii) an optimized saliency detector which exploits the visual statistics of the submap; and (iii) a fast view-matching algorithm which filters initial matchings with a structural criterion. The latter algorithm yields fine localization. Our experiments show that these elements have been successfully integrated for solving the global localization problem. Context, that is, the awareness of being in a particular submap, is defined by a supervised classifier tuned for a minimal set of features. Visual context is exploited both for tuning (optimizing) the saliency detection process, and to select potential matching views in the visual database, close enough to the query view.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Frame rate upconversion (FRUC) is an important post-processing technique to enhance the visual quality of low frame rate video. A major, recent advance in this area is FRUC based on trilateral filtering which novelty mainly derives from the combination of an edge-based motion estimation block matching criterion with the trilateral filter. However, there is still room for improvement, notably towards reducing the size of the uncovered regions in the initial estimated frame, this means the estimated frame before trilateral filtering. In this context, proposed is an improved motion estimation block matching criterion where a combined luminance and edge error metric is weighted according to the motion vector components, notably to regularise the motion field. Experimental results confirm that significant improvements are achieved for the final interpolated frames, reaching PSNR gains up to 2.73 dB, on average, regarding recent alternative solutions, for video content with varied motion characteristics.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

RESUMO: Na parte inicial incluem-se algumas notas sucintas com base no panorama científico,histórico e cultural da visão considerada segundo três abordagens - o olho (o olho humano na especificidade da sua posição filogenética, elemento anátomo-funcional básico do sistema visual ao qual o cérebro pertence), os olhos (unidades gémeas essenciais do rosto na sua actividade consensual e conjugada da binocularidade), o olhar (carregado de expressão psicológica e o seu efeito sobre o observador, sinal para o comportamento e criador de sentimentos, sedimentado em obras de arte e em formas de superstição dos povos). Segue-se a apresentação de um estudo descritivo transversal, como contribuição para o conhecimento do estado de saúde visual da população infantil da região de Lisboa e determinar factores que o influenciam. Entre Outubro de 2005 e Agosto de 2006 examinaram-se 649 crianças com idade inferior a 10 anos da Consulta de Oftalmologia Pediátrica dos Serviços de Assistência Médico-Social do Sindicato dos Bancários do Sul e Ilhas (SAMS). Colheram-se dados respeitantes a mais de 250 variáveis primárias que cobriram a maior parte dos itens do exame oftalmológico habitual. Na análise dos dados teve-se especialmente em conta a idade, com um papel decisivo nas principais fases de desenvolvimento do sistema visual. No caso das crianças de 6 a 7 anos de idade põem-se lado a lado resultados dos SAMS e das Escolas. A profusão de dados numéricos ditou a necessidade da determinação frequente da significância estatística dos resultados de subgrupos. Alguns resultados do estudo, na sua maioria do grupo SAMS: Crianças de 6-7 anos, 71,1% (SAMS) e 91,5% (Escolas) não tinham sido examinadas com menos de 4 anos. Frequência global de alterações miópicas 9,4%, de alterações hipermetrópicas 25,3%, umas e outras com variações acentuadas com a idade. Estrabismo convergente 3,9%. Ambliopia 2,6% (13/491 crianças >=4 anos de idade), mais frequente no sexo feminino, naquelas que tiveram a sua 1ª observação depois dos 4 anos e em que os pais não aderiam à terapêutica prescrita. Objectivos específicos ocuparam-se da acuidade visual e da refracção ocular. O estudo comparativo da refractometria automática sem e com cicloplegia permitiu evidenciar que o teste da acuidade visual é insuficiente, por si só, para fazer o diagnóstico correcto. A análise dos antecedentes familiares oftalmológicos demonstrou a importância do seu conhecimento e pôs em evidência, entre outras, as seguintes relações: 10 pag1.qxp 27-11-2001 18:28 Page 10 Índice Geral 11 Crianças com antecedentes de alterações miópicas têm maior frequência de diagnóstico de alterações miópicas e de refracção negativa, uma taxa mais elevada de correspondência quantitativa diagnóstico/refracção nas alterações miópicas. Estas crianças também têm, em geral, características inversas no que diz respeito a alterações hipermetrópicas. Crianças com antecedentes de alterações hipermetrópicas têm maior frequência de diagnóstico de alterações hipermetrópicas. Crianças com antecedentes de estrabismo têm maior frequência de diagnóstico de estrabismo convergente manifesto e de esodesvios no seu todo. Crianças com antecedentes familiares de astigmatismo têm maior frequência de diagnóstico de astigmatismo. Traçam-se alguns perfis oftalmológicos infantis que permitem apreciar de forma sinóptica um conjunto de parâmetros da saúde da visão. Os dados colhidos sobre a aderência dos pais à terapêutica prescrita e sobre a atitude em relação ao uso de óculos assim como os dados sobre o comportamento da criança na sala de aula e dificuldades de aprendizagem foram em geral escassos para permitirem tirar conclusões, embora mostrem indícios a investigar futuramente. Paralelamente ortoptistas e enfermeiras efectuaram um rastreio escolar da acuidade visual <0,8 e de alterações da motilidade ocular extrínseca que abrangeu 520 alunos do 1º ano do 1º ciclo do ensino básico (2005/2006) das escolas públicas da cidade de Lisboa. 101 destas crianças foram observadas no consultório da autora, umas referidas a partir do rastreio, outras como controlo deste. Quanto à acuidade visual o valor preditivo do teste negativo foi de 91% mas o do teste positivo de apenas 67% (33% de falsos positivos, consequentemente uma alta taxa de sobrerreferenciação). A qualidade do rastreio efectuado por ortoptistas foi inferior à do efectuado por enfermeiras. O rastreio não teve qualidade aceitável. Foi feito um inquérito a médicos e enfermeiros de centros de saúde sobre conhecimentos, atitudes e práticas em relação com os cuidados de oftalmologia pediátrica. Discutem-se os resultados, tiram-se conclusões e fazem-se recomendações susceptíveis de contribuir para uma melhor saúde visual das crianças. ABSTRACT: Firstly some brief remarks are made based on the scientific, historical and cultural panorama of the human vision with regard to three approaches: the eye (the human eye in its specific filogenetic place, fundamental anatomofunctional element of the visual system in interaction with the brain), the eyes (essential twin units of the face with their consensual and conjugated binocular activity), the gaze (psychologicaly overloaded, a means to express oneself and to influence the observer, a guide to other persons' behaviour, consolidated in works of art and in people's traditional superstitious believes and ways of thinking). A report is made on a cross-sectional descriptive study whose goal is to contribute to the knowledge of the level of visual health of children in the Lisbon Region and to identify factors which determine it. Between October 2005 and August 2006 649 children under 10 years were observed at the pediatric ophthalmologic consultation in the SAMS (Serviços de Assistência Médico-Social do Sindicato dos Bancários do Sul e Ilhas). Data were collected concerning more than 250 primary variables covering most itens of the usual ophthalmological examination. Special attention was paid to children's age since it plays a crucial role in main stages of visual system development. In the case of children age 6 to 7 SAMS and school results are often put side by side. On account of the great number of numerical data it was often necessary to look at the degree of statistical significancy of differencies between subgroups. Some of the study's results (mostly SAMS): Children age 6 to 7 - 71,1% (SAMS) and 91,5% (Schools) had not an ophthalmologic examination before 4 years old. Total frequency of myopic disorders 9,4%, of hypermetropic disorders 25,3%, both showing great differences between age groups; convergent strabismus 3,9%; amblyopia 2,6% (13/491 children over 3 years old), more frequent among little girls, in those with 1st examination after 4 years old and in those whose parents didn´t complied to the therapy ordered for the child. Specific objectives dealt with visual acuity and ocular refraction. The comparison of automatic refractometry without and with cycloplegy showed that visual acuity testing is often not enough for a correct diagnosis. Eye disorders in the family history proved to be a very important information. Analysis of corresponding data disclosed a lot of relationships among others: 12 pag1.qxp 27-11-2001 18:28 Page 12 Índice Geral 13 Children with a family history of myopic disorders have more frequently a diagnosis of myopic disorders and a negative refraction, a higher rate of quantitative diagnosis/refraction matching concerning myopic disorders. Those children have in general inverse characteristics regarding hypermetropic disorders. Children with a family history of hypermetropic disorders have more frequently a diagnosis of hypermetropic disorders. Children with a family history of strabismus have more frequently a diagnosis of manifest convergent strabismus and all forms of esodeviations. Children with a family history of astigmatism have more frequently a diagnosis of astigmatism. Ophthalmologic profiles are drawn allowing to take into account in a synoptic way a set of visual health parameters. Data on parents' compliance with therapy ordered for the child, and attitudes regarding child's glass wearing, as well as data on child's behaviour in the classroom and learning difficulties were as a rule too few to allow conclusions but still need more studies in the future. Orthoptists and nurses performed in the same study period a screening of visual acuity <0,8 and of ocular motility disorders addressed to children of 1srt degree of public schools (term 2005/2006) in the town of Lisbon. 520 of such children were screened. 101 of them were examined by the author in her medical office; some were refered, the others taken as a control. Regarding visual acuity the predictive value of a negative test was 91% but the predictive value of a positive test was only 67% (33% of false positive results, consequently a too high rate of overreferal). Performed by orthoptists screening quality was inferior in comparison with screening done by nurses. On the whole this screening had not the required quality. A survey on physicians' and nurses' knowledge, attitudes and practices related to pediatric ophthalmologic care was carried out in health centers. Results are discussed, conclusions drawn. Some suggestions are made aiming at a better children's visual health.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The underground scenarios are one of the most challenging environments for accurate and precise 3d mapping where hostile conditions like absence of Global Positioning Systems, extreme lighting variations and geometrically smooth surfaces may be expected. So far, the state-of-the-art methods in underground modelling remain restricted to environments in which pronounced geometric features are abundant. This limitation is a consequence of the scan matching algorithms used to solve the localization and registration problems. This paper contributes to the expansion of the modelling capabilities to structures characterized by uniform geometry and smooth surfaces, as is the case of road and train tunnels. To achieve that, we combine some state of the art techniques from mobile robotics, and propose a method for 6DOF platform positioning in such scenarios, that is latter used for the environment modelling. A visual monocular Simultaneous Localization and Mapping (MonoSLAM) approach based on the Extended Kalman Filter (EKF), complemented by the introduction of inertial measurements in the prediction step, allows our system to localize himself over long distances, using exclusively sensors carried on board a mobile platform. By feeding the Extended Kalman Filter with inertial data we were able to overcome the major problem related with MonoSLAM implementations, known as scale factor ambiguity. Despite extreme lighting variations, reliable visual features were extracted through the SIFT algorithm, and inserted directly in the EKF mechanism according to the Inverse Depth Parametrization. Through the 1-Point RANSAC (Random Sample Consensus) wrong frame-to-frame feature matches were rejected. The developed method was tested based on a dataset acquired inside a road tunnel and the navigation results compared with a ground truth obtained by post-processing a high grade Inertial Navigation System and L1/L2 RTK-GPS measurements acquired outside the tunnel. Results from the localization strategy are presented and analyzed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Localization, which is the ability of a mobile robot to estimate its position within its environment, is a key capability for autonomous operation of any mobile robot. This thesis presents a system for indoor coarse and global localization of a mobile robot based on visual information. The system is based on image matching and uses SIFT features as natural landmarks. Features extracted from training images arestored in a database for use in localization later. During localization an image of the scene is captured using the on-board camera of the robot, features are extracted from the image and the best match is searched from the database. Feature matching is done using the k-d tree algorithm. Experimental results showed that localization accuracy increases with the number of training features used in the training database, while, on the other hand, increasing number of features tended to have a negative impact on the computational time. For some parts of the environment the error rate was relatively high due to a strong correlation of features taken from those places across the environment.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Local features are used in many computer vision tasks including visual object categorization, content-based image retrieval and object recognition to mention a few. Local features are points, blobs or regions in images that are extracted using a local feature detector. To make use of extracted local features the localized interest points are described using a local feature descriptor. A descriptor histogram vector is a compact representation of an image and can be used for searching and matching images in databases. In this thesis the performance of local feature detectors and descriptors is evaluated for object class detection task. Features are extracted from image samples belonging to several object classes. Matching features are then searched using random image pairs of a same class. The goal of this thesis is to find out what are the best detector and descriptor methods for such task in terms of detector repeatability and descriptor matching rate.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The large and growing number of digital images is making manual image search laborious. Only a fraction of the images contain metadata that can be used to search for a particular type of image. Thus, the main research question of this thesis is whether it is possible to learn visual object categories directly from images. Computers process images as long lists of pixels that do not have a clear connection to high-level semantics which could be used in the image search. There are various methods introduced in the literature to extract low-level image features and also approaches to connect these low-level features with high-level semantics. One of these approaches is called Bag-of-Features which is studied in the thesis. In the Bag-of-Features approach, the images are described using a visual codebook. The codebook is built from the descriptions of the image patches using clustering. The images are described by matching descriptions of image patches with the visual codebook and computing the number of matches for each code. In this thesis, unsupervised visual object categorisation using the Bag-of-Features approach is studied. The goal is to find groups of similar images, e.g., images that contain an object from the same category. The standard Bag-of-Features approach is improved by using spatial information and visual saliency. It was found that the performance of the visual object categorisation can be improved by using spatial information of local features to verify the matches. However, this process is computationally heavy, and thus, the number of images must be limited in the spatial matching, for example, by using the Bag-of-Features method as in this study. Different approaches for saliency detection are studied and a new method based on the Hessian-Affine local feature detector is proposed. The new method achieves comparable results with current state-of-the-art. The visual object categorisation performance was improved by using foreground segmentation based on saliency information, especially when the background could be considered as clutter.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We present a statistical image-based shape + structure model for Bayesian visual hull reconstruction and 3D structure inference. The 3D shape of a class of objects is represented by sets of contours from silhouette views simultaneously observed from multiple calibrated cameras. Bayesian reconstructions of new shapes are then estimated using a prior density constructed with a mixture model and probabilistic principal components analysis. We show how the use of a class-specific prior in a visual hull reconstruction can reduce the effect of segmentation errors from the silhouette extraction process. The proposed method is applied to a data set of pedestrian images, and improvements in the approximate 3D models under various noise conditions are shown. We further augment the shape model to incorporate structural features of interest; unknown structural parameters for a novel set of contours are then inferred via the Bayesian reconstruction process. Model matching and parameter inference are done entirely in the image domain and require no explicit 3D construction. Our shape model enables accurate estimation of structure despite segmentation errors or missing views in the input silhouettes, and works even with only a single input view. Using a data set of thousands of pedestrian images generated from a synthetic model, we can accurately infer the 3D locations of 19 joints on the body based on observed silhouette contours from real images.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This thesis presents there important results in visual object recognition based on shape. (1) A new algorithm (RAST; Recognition by Adaptive Sudivisions of Tranformation space) is presented that has lower average-case complexity than any known recognition algorithm. (2) It is shown, both theoretically and empirically, that representing 3D objects as collections of 2D views (the "View-Based Approximation") is feasible and affects the reliability of 3D recognition systems no more than other commonly made approximations. (3) The problem of recognition in cluttered scenes is considered from a Bayesian perspective; the commonly-used "bounded-error errorsmeasure" is demonstrated to correspond to an independence assumption. It is shown that by modeling the statistical properties of real-scenes better, objects can be recognized more reliably.