12 resultados para Visual perception
em Chinese Academy of Sciences Institutional Repositories Grid Portal
Resumo:
It is important for practical application to design an effective and efficient metric for video quality. The most reliable way is by subjective evaluation. Thus, to design an objective metric by simulating human visual system (HVS) is quite reasonable and available. In this paper, the video quality assessment metric based on visual perception is proposed. Three-dimensional wavelet is utilized to decompose video and then extract features to mimic the multichannel structure of HVS. Spatio-temporal contrast sensitivity function (S-T CSF) is employed to weight coefficient obtained by three-dimensional wavelet to simulate nonlinearity feature of the human eyes. Perceptual threshold is exploited to obtain visual sensitive coefficients after S-T CSF filtered. Visual sensitive coefficients are normalized representation and then visual sensitive errors are calculated between reference and distorted video. Finally, temporal perceptual mechanism is applied to count values of video quality for reducing computational cost. Experimental results prove the proposed method outperforms the most existing methods and is comparable to LHS and PVQM.
Resumo:
According to Chen's theory, topological differences are perceived faster than feature differences in early visual perception. We hypothesized that topological perception is caused by the sensitivity in discriminating figures with and without "holes". An E
Resumo:
Global information is considered the primitive of visual perception in Gestalt psychology. Further, L. Chen ( 2005) proposed a new theory of topological visual perception. According to this theory, the perception of topological difference is faster than o
Resumo:
随着机器人应用范围的不断扩展,机器人所面临的工作环境也越来越复杂,多数是未知的、动态的和非结构化的。通过对基于行为的机器人控制技术的研究,设计了一种用于完成多目标任务的移动机器人行为控制系统。将基于行为的控制技术融合进模糊控制的思想中,使移动机器人的行为通过运用模糊控制和基于优先度的行为决策来实现,并且通过视觉信息使机器人能够完成面向目标的任务。
Resumo:
在JPEG静止图象压缩的基础上,设计了一种扩充的自适应量化器.利用人眼的视觉特征,通过分析MCU块的局部视觉活动性,以MCU活动性函数确定量化因子,并引入亮度掩盖算子调节量化参量.实验结果表明,本文所设计的自适应量化器能减少图象编码主观失真,改善图象质量,获得更好的压缩效果
Resumo:
IQ Structure, Psycholinguistic and Visual-motor Abilities Study on Children Learning Disability TONG Fang Directed by professor Zhu Liqi (Developmental and educational psychology) ABSTRACT Objective To comprehensive analyze the IQ structures, and relationships among IQ, psychometric characteristics and visual-motor integration on children disability. At same time, to probe into the family factors that influenced IQ, psycholinguistic abilities and behavior of LD children. Method (1) Downloading the papers on children learning disability from www.cqvip.com and www.wanfangdata.com, in which, the articles were collected by key words from 1985 to 2005. To conduct meta-analysis on IQ construction, compare the case group and the control group, including full IQ, verbal and practice IQ. (2) Designed with model compared and self-compared, 59 diagnosed learning disability children, tested themes with WISC, ITPA and Berry’s VMI. WISC included 10 items, 5 of which subtotal to verbal and practice IQ respectively. IPTA included 10 items, too, 5 process of which subtotal to auditory and visual perception. The first 3 items shared representation level, the other 2 of that shared automatic level.VMI had one score. Analyzed factors and levels with description and Pearson Correlation. To probe to linguistic internal alternately functions of LD children, and compare the scores of groups in different IQ. (3) Analyzed the perspective questionnaire filled by parents. Early development facts compared with model groups. Factors relationships analyzed with Kendall correlation, KOM and Bartlett’s test of sphericity, Promax Rotation. Results: (1) There have been 319 papers related with LD, in which 36 with IQ and 14 valid reports have been analyzed by Meta. FIQ’s 95%CI (confidence interval) is 2.418 ~ 0.172, VIQ between the difficulty and non- difficulty group. C-WISC-R reports were 10 papers, of which, 95%CI of FIQ is 2.424 ~ 0.676, of VIQ is 2.314 ~ 1.196, of PIQ is 2.176 ~ 0.176. The VIQ comparing the PIQ, 95%CI is 1.1 ~ -0.07 in difficulty group and 0.5 ~ -0.0046 in non-difficult group. Nevertheless, in the other 4 tests, FIQ’s 95%CI is 2.00 ~ -0.818 between LD and NLD. (2) Children psycholinguistic abilities had strong relation with Berry’s VMI test excluding auditory reception, and with perceptive factor of intelligence excluding verbal expression. Auditory reception and visual closure had strong relation with FIQ and PIQ. Grammatic closure, visual association and manual expression had strong relation with concept factor. The representational and automatic levels are depended on integration of auditory and visual procession. Lower verbal expression (VE) let to lower expression process and low scores on representational level. Lower visual sequential memory (VSM) let to lower memory process and influenced automatic level. Groups compared by IQ 90 show that LD children with under IQ 90 had lower scores on items of IPTA than with up IQ 90 excluded verbal expression. It was proved that IQ administrated the linguistic ability. Nevertheless, general abilities deficiency didn’t show influencing on the types of the perceptive delay. There was mutual function among linguistic ability on LD children. Auditory and visual level are overlapped each other. Not only show higher Decoding and lower Encoding on Auditory perception, lower Decoding and higher Encoding on Visual perception, in representation, but also higher Sequential remember, lower Closure on Audition, and lower Sequential member, higher Closure on Vision, in Automation. Nevertheless, there was no different between Representational and Automatic level, which may be the relationship of parallel or evolution. (3) Major family factors were father’s education, occupation. Lower auditory perception related to unconcerned, lower visual perception related to premature delivery and written slowly. Threatened–abortion, childbirth-suffocated were known as influencing children’s IQ and later linguistic abilities. It wasn’t shown that dosage relationship with the types of perceptive delay. Conclusion: (1) The FIQ, VIQ and PIQ of Children with LD is lower than that of NLD group. There is no significantly different between VIQ and PIQ in LD and NLD groups. (2) The objectives of ITPA and WISC tests are differently. The psycholinguistic abilities had strong relation with perceptive factor and VMI. Some facts of IPTA related with FIQ. IQ had strong administration on linguistic abilities. There was mutual function among linguistic internal abilities. (3) Family facts on IQ and psycholinguistic abilities were Father’s education, abnormal pregnant and abortion. It would be pre-show development delay in early period.
Resumo:
One of the great puzzles in the psychology of visual perception is that the visual world appears to be a coherent whole despite our viewing it through temporally discontinuous series of eye fixations. The investigators attempted to explain this puzzle from the perspective of sequential visual information integration. In recent years, investigators hypothesized that information maintained in the visual short-term memory (VSTM) could become visual mental images gradually during time delay in visual buffer and integrated with information perceived currently. Some elementary studies had been carried out to investigate the integration between VSTM and visual percepts, but further research is required to account for several questions on the spatial-temporal characteristics, information representation and mechanism of integrating sequential visual information. Based on the theory of similarity between visual mental image and visual perception, this research (including three studies) employed the temporal integration paradigm and empty cell localization task to further explore the spatial-temporal characteristics, information representation and mechanism of integrating sequential visual information (sequential arrays). The purpose of study 1 was to further explore the temporal characteristics of sequential visual information integration by examining the effects of encoding time of sequential stimuli on the integration of sequential visual information. The purpose of study 2 was to further explore the spatial characteristics of sequential visual information integration by investigating the effects of spatial characteristics change on the integration of sequential visual information. The purpose of study 3 was to explore the information representation of information maintained in the VSTM and integration mechanism in the process of integrating sequential visual information by employing the behavioral experiments and eye tracking technology. The results indicated that: (1) Sequential arrays could be integrated without strategic instruction. Increasing the duration of the first array could cause improvement in performance and increasing the duration of the second array could not improve the performance. Temporal correlation model was not fit to explain the sequential array integration under long-ISI conditions. (2) Stimuli complexity influenced not only the overall performance of sequential arrays but also the values of ISI at asymptotic level of performance. Sequential arrays still could be integrated when the spatial characteristics of sequential arrays changed. During ISI, constructing and manipulating of visual mental image of array 1 were two separate processing phases. (3) During integrating sequential arrays, people represented the pattern constituted by the objects' image maintained in the VSTM and the topological characteristics of the objects' image had some impact on fixation location. The image-perception integration hypothesis was supported when the number of dots in array 1 was less than empty cells, and the convert-and-compare hypothesis was supported when the number of the dot in array 1 was equal to or more than empty cells. These findings not only contribute to make people understand the process of sequential visual information integration better, but also have significant practical application in the design of visual interface.
Resumo:
In the history of psychology research, more attention had been focused on the relation between local processing and global processing. For the global information and the local information, which is processed earlier? And which is processed faster? Precedence of the global over the local level in visual perception has been well established by Navon with compound stimuli, and Navon’s original study gave rise to many publications, including replications, generalization to other kinds of stimuli (nonverbal material, digits), populations (infants, children, brain-damaged subjects), and tasks (lateral visual hemifield presentation, copy drawing, memory recognition, and recall), and triggered some debate about the conditions in which global precedence is and is not observed (number, size, sparsity, and goodness of the stimuli, exposure duration, etc.). However, whether there is a global advantage or precedence in other cognitive processes was less tested. Most researches had suggested that there was a functional equivalency between visual perception and visual image processing. However, it’s still unknown whether there will be a global advantage on mental rotation. In the present study, we combined the mental rotation task with the compound stimuli to explore whether the global or local advantage also existed at the mental imagery transformation stages. In two pilot studies, the perceptual global precedence was found to be present in a normal/mirror-image judgment task when the stimuli exposure time was short; while the stimuli exposure time was prolonged (stimuli kept available till subjects’ response) the perceptual global precedence was showed to be eliminated. In all of the subsequent experiments, stimili would be presented till subjects’ response. Then mental rotation was added in normal/mirror-image judgment (some of the stimuli were rotated to certain angles from upright) in normal experiments, experiment 1 and 2 observed a global advantage on mental rotation both with a focused-attention design (Experiment 1) and divided-attention design (Experiment 2). Subjects’ reaction times were increased with rotation angles, and the accuracy was decreased with rotation angles, suggesting that subject need a mental rotation to make a normal/mirror judgment. The most important results were that subjects’ response to global rotation was faster than that to local rotation. The analysis of slope of rotation further indicated that, to some extend, the speed of global rotation was faster than that of local rotation. These results suggest a global advantage on mental rotation. Experiment 3 took advantage of the high temporal resolution of event-related potentials to explore the temporal pattern of global advantage on mental rotation. Event-related potential results indicated the parietal P300 amplitude was inversely related to the character orientation, and the local rotation task delayed the onset of the mental-rotation-related negativity at parietal electrodes. None clear effect was found for occipital N150. All these results suggested that the global rotation was not only processed faster than local rotation, but also occurred earlier than local rotation. Experiments 4 and 5 took the effect size of global advantage as the main dependent variable, and visual angle and exposure duration of the stimuli as independent variables, to examine the relationship between perceptual global precedence and global advantage on mental rotation. Results indicated that visual angle and exposure duration did not influence the effect size of global advantage on mental rotation. The global advantage on mental rotation and the perceptual global advantage seemed to be independent but their effects could be accumulated at some condition. These findings not only contribute to revealing a new processing property of mental rotation, but also deepen our understanding of the problem of global/local processing and shed light on the debate on locus of global precedence.
Resumo:
The researches of the CC's form processing mainly involved the effects of all kinds of form properties. In most of the cases, the researches were conducted after the lexical process completed. A few which was about early phases of visual perception focused on the process of feature extraction in character recognition. Up till now, nobody put forward a propose that we should study the form processing in the early phases of visual perception of CC. We hold that because the form processing occurs in the early phases of visual perception, we should study the processing prelexically. Moreover, visual perception of CC is a course during which the CC becomes clear gradually, so that the effects of all kinds of form properties should not be a absolute phenomena of an all-or-none. In this study we adopted 4 methods to research the form processing in the early phases simulatedly and systematically, including the tachistoscopic repetition, increasing time to present gradually, enlarging the visual angle gradually and non- tachistoscopic searching and naming. Under all kinds of bad or degraded visual conditions, the instantaneous course of early-phases processing was slowed down and postponed, and then the growth course was open to before our eyes. We can captured the characteristics of the form processing in the early phases by analyzing the reaction speed and recognition accuracy. Accompanying the visual angle and time increasing, the clarity improved and we can find out the relation between the effects of form properties and visual clarity improving. The results were as follows: ①in the early phases of visual perception of CC, there were the effects of all kinds of form properties. ②the quantity of the effects would cut down when the visual conditions were being changed better and better. We raised the concept of character's space transparency and it's algorithm to explain these effects of form properties. Furthermore, a model was discussed to help understand the phenomenon that the quantity of the effects changed as the visual conditions were improved. ③The early phases of visual perception of CC isn't the loci of the frequency effect.
Resumo:
Whether mice perceive the depth of space dependent on the visual size of object targets was explored when visual cues such as perspective and partial occlusion in space were excluded. A mouse was placed on a platform the height of which is adjustable. The platform located inside a box in which all other walls were dark exception its bottom through that light was projected as a sole visual cue. The visual object cue was composed of 4x4 grids to allow a mouse estimating the distance of the platform relative to the grids. Three sizes of grids reduced in a proportion of 2/3 and seven distances with an equal interval between the platform and the grids at the bottom were applied in the experiments. The duration of a mouse staying on the platform at each height was recorded when the different sizes of the grids were presented randomly to test whether the Judgment of the mouse for the depth of the platform from the bottom was affected by the size information of the visual target. The results from all conditions of three object sizes show that time of mice staying on the platform became longer with the increase in height. In distance of 20 similar to 30 cm, the mice did not use the size information of a target to judge the depth, while mainly used the information of binocular disparity. In distance less than 20 cm or more than 30 cm, however, especially in much higher distance 50 cm, 60 cm and 70 cm, the mice were able to use the size information to do so in order to compensate the lack of binocular disparity information from both eyes. Because the mice have only 1/3 of the visual field that is binocular. This behavioral paradigm established in the current study is a useful model and can be applied to the experiments using transgenic mouse as an animal model to investigate the relationships between behaviors and gene functions.
Resumo:
Both commercial and scientific applications often need to transform color images into gray-scale images, e. g., to reduce the publication cost in printing color images or to help color blind people see visual cues of color images. However, conventional color to gray algorithms are not ready for practical applications because they encounter the following problems: 1) Visual cues are not well defined so it is unclear how to preserve important cues in the transformed gray-scale images; 2) some algorithms have extremely high time cost for computation; and 3) some require human-computer interactions to have a reasonable transformation. To solve or at least reduce these problems, we propose a new algorithm based on a probabilistic graphical model with the assumption that the image is defined over a Markov random field. Thus, color to gray procedure can be regarded as a labeling process to preserve the newly well-defined visual cues of a color image in the transformed gray-scale image. Visual cues are measurements that can be extracted from a color image by a perceiver. They indicate the state of some properties of the image that the perceiver is interested in perceiving. Different people may perceive different cues from the same color image and three cues are defined in this paper, namely, color spatial consistency, image structure information, and color channel perception priority. We cast color to gray as a visual cue preservation procedure based on a probabilistic graphical model and optimize the model based on an integral minimization problem. We apply the new algorithm to both natural color images and artificial pictures, and demonstrate that the proposed approach outperforms representative conventional algorithms in terms of effectiveness and efficiency. In addition, it requires no human-computer interactions.