11 resultados para human visual masking
em Chinese Academy of Sciences Institutional Repositories Grid Portal
Resumo:
It is important for practical application to design an effective and efficient metric for video quality. The most reliable way is by subjective evaluation. Thus, to design an objective metric by simulating human visual system (HVS) is quite reasonable and available. In this paper, the video quality assessment metric based on visual perception is proposed. Three-dimensional wavelet is utilized to decompose video and then extract features to mimic the multichannel structure of HVS. Spatio-temporal contrast sensitivity function (S-T CSF) is employed to weight coefficient obtained by three-dimensional wavelet to simulate nonlinearity feature of the human eyes. Perceptual threshold is exploited to obtain visual sensitive coefficients after S-T CSF filtered. Visual sensitive coefficients are normalized representation and then visual sensitive errors are calculated between reference and distorted video. Finally, temporal perceptual mechanism is applied to count values of video quality for reducing computational cost. Experimental results prove the proposed method outperforms the most existing methods and is comparable to LHS and PVQM.
Resumo:
A novel spatiotemporal segmentation technique is further developed for extracting uncovered background and moving objects from the image sequences, then the following motion estimation is performed only on the regions corresponding to moving objects. The frame difference contrast (FCON) and local variance contrast (LCON), which are related to the temporal and spatial homogeneity of the image sequence, are selected to form the 2-D spatiotemporal entropy. Then the spatial segmentation threshold is determined by maximizing the 2-D spatiotemporal entropy, and the temporal segmentation point is selected to minimize the complexity measure for image sequence coding. Since both temporal and spatial correlation of an image sequence are exploited, this proposed spatiotemporal segmentation technique can further be used to determine the positions of reference frames adaptively, hence resulting in a low bit rate. Experimental results show that this segmentation-based coding scheme is more efficient than usual fixed-size coding algorithms. (C) 1997 Society of Photo-Optical Instrumentation Engineers.
Resumo:
Human visual function declines with age. Much of this decline is mediated by changes in the central visual pathways. In this study we compared the spatial and temporal sensitivities of striate cortical cells in young and old paralysed macaque monkeys. Ext
Resumo:
Inspired by human visual cognition mechanism, this paper first presents a scene classification method based on an improved standard model feature. Compared with state-of-the-art efforts in scene classification, the newly proposed method is more robust, more selective, and of lower complexity. These advantages are demonstrated by two sets of experiments on both our own database and standard public ones. Furthermore, occlusion and disorder problems in scene classification in video surveillance are also first studied in this paper.
Resumo:
Human cerebral cortical function degrades during old age. Much of this change may result from a degradation of intracortical inhibition during senescence. We used multibarreled microelectrodes to study the effects of electrophoretic application of gamma-aminobutyric acid (GABA), the GABA type a (GABAa) receptor agonist muscimol, and the GABAa receptor antagonist bicuculline, respectively, on the properties of individual V1 cells in old monkeys. Bicuculline exerted a much weaker effect on neuronal responses in old than in young animals, confirming a degradation of GABA-mediated inhibition. On the other hand, the administration of GABA and muscimol resulted in improved visual function. Many treated cells in area V1 of old animals displayed responses typical of young cells. The present results have important implications for the treatment of the sensory, motor, and cognitive declines that accompany old age.
Resumo:
Global information is considered the primitive of visual perception in Gestalt psychology. Further, L. Chen ( 2005) proposed a new theory of topological visual perception. According to this theory, the perception of topological difference is faster than o
Resumo:
The Yangtze River dolphin or baiji ( Lipotes vexillifer), an obligate freshwater odontocete known only from the middle-lower Yangtze River system and neighbouring Qiantang River in eastern China, has long been recognized as one of the world's rarest and most threatened mammal species. The status of the baiji has not been investigated since the late 1990s, when the surviving population was estimated to be as low as 13 individuals. An intensive six-week multivessel visual and acoustic survey carried out in November-December 2006, covering the entire historical range of the baiji in the main Yangtze channel, failed to find any evidence that the species survives. We are forced to conclude that the baiji is now likely to be extinct, probably due to unsustainable by-catch in local fisheries. This represents the first global extinction of a large vertebrate for over 50 years, only the fourth disappearance of an entire mammal family since AD 1500, and the first cetacean species to be driven to extinction by human activity. Immediate and extreme measures may be necessary to prevent the extinction of other endangered cetaceans, including the sympatric Yangtze finless porpoise ( Neophocaena phocaenoides asiaeorientalis).
Resumo:
Both commercial and scientific applications often need to transform color images into gray-scale images, e. g., to reduce the publication cost in printing color images or to help color blind people see visual cues of color images. However, conventional color to gray algorithms are not ready for practical applications because they encounter the following problems: 1) Visual cues are not well defined so it is unclear how to preserve important cues in the transformed gray-scale images; 2) some algorithms have extremely high time cost for computation; and 3) some require human-computer interactions to have a reasonable transformation. To solve or at least reduce these problems, we propose a new algorithm based on a probabilistic graphical model with the assumption that the image is defined over a Markov random field. Thus, color to gray procedure can be regarded as a labeling process to preserve the newly well-defined visual cues of a color image in the transformed gray-scale image. Visual cues are measurements that can be extracted from a color image by a perceiver. They indicate the state of some properties of the image that the perceiver is interested in perceiving. Different people may perceive different cues from the same color image and three cues are defined in this paper, namely, color spatial consistency, image structure information, and color channel perception priority. We cast color to gray as a visual cue preservation procedure based on a probabilistic graphical model and optimize the model based on an integral minimization problem. We apply the new algorithm to both natural color images and artificial pictures, and demonstrate that the proposed approach outperforms representative conventional algorithms in terms of effectiveness and efficiency. In addition, it requires no human-computer interactions.
Resumo:
Eye detection plays an important role in many practical applications. This paper presents a novel two-step scheme for eye detection. The first step models an eye by a newly defined visual-context pattern (VCP), and the second step applies semisupervised boosting for precise detection. VCP describes both the space and appearance relations between an eye region (region of eye) and a reference region (region of reference). The context feature of a VCP is extracted by using the integral image. Aiming to reduce the human labeling efforts, we apply semisupervised boosting, which integrates the context feature and the Haar-like features for precise eye detection. Experimental results on several standard face data sets demonstrate that the proposed approach is effective, robust, and efficient. We finally show that this approach is ready for practical applications.