861 resultados para Bag-of-visual Words
Resumo:
Asperger Syndrome (AS) belongs to autism spectrum disorders where both verbal and non-verbal communication difficulties are at the core of the impairment. Social communication requires a complex use of affective, linguistic-cognitive and perceptual processes. In the four studies included in the current thesis, some of the linguistic and perceptual factors that are important for face-to-face communication were studied using behavioural methods. In all four studies the results obtained from individuals with AS were compared with typically developed age, gender and IQ matched controls. First, the language skills of school-aged children were characterized in detail with standardized tests that measured different aspects of receptive and expressive language (Study I). The children with AS were found to be worse than the controls in following complex verbal instructions. Next, the visual perception of facial expressions of emotion with varying degrees of visual detail was examined (Study II). Adults with AS were found to have impaired recognition of facial expressions on the basis of very low spatial frequencies which are important for processing global information. Following that, multisensory perception was investigated by looking at audiovisual speech perception (Studies III and IV). Adults with AS were found to perceive audiovisual speech qualitatively differently from typically developed adults, although both groups were equally accurate in recognizing auditory and visual speech presented alone. Finally, the effect of attention on audiovisual speech perception was studied by registering eye gaze behaviour (Study III) and by studying the voluntary control of visual attention (Study IV). The groups did not differ in eye gaze behaviour or in the voluntary control of visual attention. The results of the study series demonstrate that many factors underpinning face-to-face social communication are atypical in AS. In contrast with previous assumptions about intact language abilities, the current results show that children with AS have difficulties in understanding complex verbal instructions. Furthermore, the study makes clear that deviations in the perception of global features in faces expressing emotions as well as in the multisensory perception of speech are likely to harm face-to-face social communication.
Resumo:
Our everyday visual experience frequently involves searching for objects in clutter. Why are some searches easy and others hard? It is generally believed that the time taken to find a target increases as it becomes similar to its surrounding distractors. Here, I show that while this is qualitatively true, the exact relationship is in fact not linear. In a simple search experiment, when subjects searched for a bar differing in orientation from its distractors, search time was inversely proportional to the angular difference in orientation. Thus, rather than taking search reaction time (RT) to be a measure of target-distractor similarity, we can literally turn search time on its head (i.e. take its reciprocal 1/RT) to obtain a measure of search dissimilarity that varies linearly over a large range of target-distractor differences. I show that this dissimilarity measure has the properties of a distance metric, and report two interesting insights come from this measure: First, for a large number of searches, search asymmetries are relatively rare and when they do occur, differ by a fixed distance. Second, search distances can be used to elucidate object representations that underlie search - for example, these representations are roughly invariant to three-dimensional view. Finally, search distance has a straightforward interpretation in the context of accumulator models of search, where it is proportional to the discriminative signal that is integrated to produce a response. This is consistent with recent studies that have linked this distance to neuronal discriminability in visual cortex. Thus, while search time remains the more direct measure of visual search, its reciprocal also has the potential for interesting and novel insights. (C) 2012 Elsevier Ltd. All rights reserved.
Resumo:
We propose a new paradigm for displaying comments: showing comments alongside parts of the article they correspond to. We evaluate the effectiveness of various approaches for this task and show that a combination of bag of words and topic models performs the best.
Resumo:
There are many popular models available for classification of documents like Naïve Bayes Classifier, k-Nearest Neighbors and Support Vector Machine. In all these cases, the representation is based on the “Bag of words” model. This model doesn't capture the actual semantic meaning of a word in a particular document. Semantics are better captured by proximity of words and their occurrence in the document. We propose a new “Bag of Phrases” model to capture this discriminative power of phrases for text classification. We present a novel algorithm to extract phrases from the corpus using the well known topic model, Latent Dirichlet Allocation(LDA), and to integrate them in vector space model for classification. Experiments show a better performance of classifiers with the new Bag of Phrases model against related representation models.
Resumo:
In this paper, we have proposed a simple and effective approach to classify H.264 compressed videos, by capturing orientation information from the motion vectors. Our major contribution involves computing Histogram of Oriented Motion Vectors (HOMV) for overlapping hierarchical Space-Time cubes. The Space-Time cubes selected are partially overlapped. HOMV is found to be very effective to define the motion characteristics of these cubes. We then use Bag of Features (B OF) approach to define the video as histogram of HOMV keywords, obtained using k-means clustering. The video feature, thus computed, is found to be very effective in classifying videos. We demonstrate our results with experiments on two large publicly available video database.
Resumo:
Objective: The aim of this study is to validate the applicability of the PolyVinyliDene Fluoride (PVDF) nasal sensor to assess the nasal airflow, in healthy subjects and patients with nasal obstruction and to correlate the results with the score of Visual Analogue Scale (VAS). Methods: PVDF nasal sensor and VAS measurements were carried out in 50 subjects (25-healthy subjects and 25 patients). The VAS score of nasal obstruction and peak-to-peak amplitude (Vp-p) of nasal cycle measured by PVDF nasal sensors were analyzed for right nostril (RN) and left nostril (LN) in both the groups. Spearman's rho correlation was calculated. The relationship between PVDF nasal sensor measurements and severity of nasal obstruction (VAS score) were assessed by ANOVA. Results: In healthy group, the measurement of nasal airflow by PVDF nasal sensor for RN and LN were found to be 51.14 +/- 5.87% and 48.85 +/- 5.87%, respectively. In patient group, PVDF nasal sensor indicated lesser nasal airflow in the blocked nostrils (RN: 23.33 +/- 10.54% and LN: 32.24 +/- 11.54%). Moderate correlation was observed in healthy group (r = 0.710, p < 0.001 for RN and r = 0.651, p < 0.001 for LN), and moderate to strong correlation in patient group (r = 0.751, p < 0.01 for RN and r = 0.885, p < 0.0001 for LN). Conclusion: PVDF nasal sensor method is a newly developed technique for measuring the nasal airflow. Moderate to strong correlation was observed between PVDF nasal sensor data and VAS scores for nasal obstruction. In our present study, PVDF nasal sensor technique successfully differentiated between healthy subjects and patients with nasal obstruction. Additionally, it can also assess severity of nasal obstruction in comparison with VAS. Thus, we propose that the PVDF nasal sensor technique could be used as a new diagnostic method to evaluate nasal obstruction in routine clinical practice. (C) 2015 Elsevier Inc. All rights reserved.
Resumo:
In optical character recognition of very old books, the recognition accuracy drops mainly due to the merging or breaking of characters. In this paper, we propose the first algorithm to segment merged Kannada characters by using a hypothesis to select the positions to be cut. This method searches for the best possible positions to segment, by taking into account the support vector machine classifier's recognition score and the validity of the aspect ratio (width to height ratio) of the segments between every pair of cut positions. The hypothesis to select the cut position is based on the fact that a concave surface exists above and below the touching portion. These concave surfaces are noted down by tracing the valleys in the top contour of the image and similarly doing it for the image rotated upside-down. The cut positions are then derived as closely matching valleys of the original and the rotated images. Our proposed segmentation algorithm works well for different font styles, shapes and sizes better than the existing vertical projection profile based segmentation. The proposed algorithm has been tested on 1125 different word images, each containing multiple merged characters, from an old Kannada book and 89.6% correct segmentation is achieved and the character recognition accuracy of merged words is 91.2%. A few points of merge are still missed due to the absence of a matched valley due to the specific shapes of the particular characters meeting at the merges.
Resumo:
Among the human factors that influence safe driving, visual skills of the driver can be considered fundamental. This study mainly focuses on investigating the effect of visual functions of drivers in India on their road crash involvement. Experiments were conducted to assess vision functions of Indian licensed drivers belonging to various organizations, age groups and driving experience. The test results were further related to the crash involvement histories of drivers through statistical tools. A generalized linear model was developed to ascertain the influence of these traits on propensity of crash involvement. Among the sampled drivers, colour vision, vertical field of vision, depth perception, contrast sensitivity, acuity and phoria were found to influence their crash involvement rates. In India, there are no efficient standards and testing methods to assess the visual capabilities of drivers during their licensing process and this study highlights the need for the same.
Resumo:
Resumen: Empleando la teoría de la “estructura comunitaria”, un muestreo de diarios principales en 28 ciudades grandes en Estados Unidos examina la cobertura del tema “El manejo de contaminación de agua y acceso a agua potable”. Mediante el análisis de todos los artículos de más de 250 palabras publicados a través de diez años entre 01/01/2001 y 01/01/2011 (339 artículos), se compararon sistemáticamente características comunitarias y el “Vector Mediático” de Pollock (combinando en un valor dos medidas de contenido: la “prominencia” de un artículo en un periódico con la orientación o tono). Cobertura “favorable”, que apoya la mayor ayuda gubernamental para mejorar el abastecimiento de agua potable, fue vinculada con medidas de “los interesados”, por ejemplo, con el porcentaje de hispanos (r de Pearson = .349, p = .04). El análisis de las medidas y su regresión reveló dos medidas significativas asociadas con apoyo para manejo gubernamental por agua potable: porcentaje de hispanos (12.2% de la varianza), y con porcentaje de ciudadanos de 18-24 años, 16.7%. Inesperadamente, la cobertura de manejo gubernamental para mejorar las existencias de agua potable no fue vinculado ni con medidas de “vulnerabilidad” (pobreza, desempleo) ni con medidas de “estabilidad” (educación, ingreso).
Resumo:
903 páginas, bibliografía en páginas 854-895, glosario en páginas 896-903
Resumo:
Digital text benefits a wide range of learners, particularly disabled learners. For those with a visual impairment, it can be magnified or read out loud using synthetic speech. It can be navigated by heading and subheading levels, and text colours and backgrounds can be altered, both useful features for dyslexic learners. Definitions of unfamiliar words can be checked without leaving the text.
Resumo:
This work deals with two related areas: processing of visual information in the central nervous system, and the application of computer systems to research in neurophysiology.
Certain classes of interneurons in the brain and optic lobes of the blowfly Calliphora phaenicia were previously shown to be sensitive to the direction of motion of visual stimuli. These units were identified by visual field, preferred direction of motion, and anatomical location from which recorded. The present work is addressed to the questions: (1) is there interaction between pairs of these units, and (2) if such relationships can be found, what is their nature. To answer these questions, it is essential to record from two or more units simultaneously, and to use more than a single recording electrode if recording points are to be chosen independently. Accordingly, such techniques were developed and are described.
One must also have practical, convenient means for analyzing the large volumes of data so obtained. It is shown that use of an appropriately designed computer system is a profitable approach to this problem. Both hardware and software requirements for a suitable system are discussed and an approach to computer-aided data analysis developed. A description is given of members of a collection of application programs developed for analysis of neuro-physiological data and operated in the environment of and with support from an appropriate computer system. In particular, techniques developed for classification of multiple units recorded on the same electrode are illustrated as are methods for convenient graphical manipulation of data via a computer-driven display.
By means of multiple electrode techniques and the computer-aided data acquisition and analysis system, the path followed by one of the motion detection units was traced from open optic lobe through the brain and into the opposite lobe. It is further shown that this unit and its mirror image in the opposite lobe have a mutually inhibitory relationship. This relationship is investigated. The existence of interaction between other pairs of units is also shown. For pairs of units responding to motion in the same direction, the relationship is of an excitatory nature; for those responding to motion in opposed directions, it is inhibitory.
Experience gained from use of the computer system is discussed and a critical review of the current system is given. The most useful features of the system were found to be the fast response, the ability to go from one analysis technique to another rapidly and conveniently, and the interactive nature of the display system. The shortcomings of the system were problems in real-time use and the programming barrier—the fact that building new analysis techniques requires a high degree of programming knowledge and skill. It is concluded that computer system of the kind discussed will play an increasingly important role in studies of the central nervous system.
Resumo:
To understand mechanisms underlying laser-induced damage of BK7 and fused silica, we calculate the temperature field of the substrates with CO2 laser irradiating at a given laser power and beam radius. We find that the two glasses show different thermal behaviors. A model is developed for estimating the time t to heat the surface of the substrates up to a particular temperature T with cw CO2 laser irradiation. We calculate theoretically the duration t that the samples are irradiated, from the beginning to visual catastrophic damage, with the assumption of damage threshold determined by the critical temperature. The duration t that the samples are irradiated, from the beginning to visual catastrophic damage, is investigated experimentally as well. Here we take the melting point or softening point as the critical temperature, given the thermomechanical coupling properties, which is enough to cause damage for BK7. Damage features are characterized by the sound of visual cracks. Finally, we calculate stresses induced by laser heating. The analysis of stress indicates that the damage of BK7 is due to the stresses induced by laser heating. (c) 2005 Society of Photo-Optical Instrumentation Engineers.
Resumo:
To understand mechanisms underlying laser-induced damage of BK7 and fused silica, we calculate the temperature field of the substrates with CO2 laser irradiating at a given laser power and beam radius. We find that the two glasses show different thermal behaviors. A model is developed for estimating the time t to heat the surface of the substrates up to a particular temperature T with cw CO2 laser irradiation. We calculate theoretically the duration t that the samples are irradiated, from the beginning to visual catastrophic damage, with the assumption of damage threshold determined by the critical temperature. The duration t that the samples are irradiated, from the beginning to visual catastrophic damage, is investigated experimentally as well. Here we take the melting point or softening point as the critical temperature, given the thermomechanical coupling properties, which is enough to cause damage for BK7. Damage features are characterized by the sound of visual cracks. Finally, we calculate stresses induced by laser heating. The analysis of stress indicates that the damage of BK7 is due to the stresses induced by laser heating. (c) 2005 Society of Photo-Optical Instrumentation Engineers.
Resumo:
Parte da hipótese que a obra poética, artística ou não, tem força para além da mediação da palavra, ou seja, da afirmação de sentidos que obliterariam a eloquência da presença. Busca discorrer sobre a teoria da presença e sua importância na interlocução com a produção poética nas artes visuais contemporâneas. Aponta a utilidade dessa argumentação como perspectiva para problematizar a Cultura Visual e defender o investimento na elucidação do universo das imagens visuais como elemento de formação humana em sintonia com as questões da alteridade e com os tempos de hoje. Entende que o estudo equalizador entre as imagens visuais e as obras de arte visual favorece a autonomia dos indivíduos e o melhor aproveitamento do mundo das artes com menor risco de sujeição às hegemonias culturais