983 resultados para auditory scene analysis


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Recently, vision-based advanced driver-assistance systems (ADAS) have received a new increased interest to enhance driving safety. In particular, due to its high performance–cost ratio, mono-camera systems are arising as the main focus of this field of work. In this paper we present a novel on-board road modeling and vehicle detection system, which is a part of the result of the European I-WAY project. The system relies on a robust estimation of the perspective of the scene, which adapts to the dynamics of the vehicle and generates a stabilized rectified image of the road plane. This rectified plane is used by a recursive Bayesian classi- fier, which classifies pixels as belonging to different classes corresponding to the elements of interest of the scenario. This stage works as an intermediate layer that isolates subsequent modules since it absorbs the inherent variability of the scene. The system has been tested on-road, in different scenarios, including varied illumination and adverse weather conditions, and the results have been proved to be remarkable even for such complex scenarios.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The application of thematic maps obtained through the classification of remote images needs the obtained products with an optimal accuracy. The registered images from the airplanes display a very satisfactory spatial resolution, but the classical methods of thematic classification not always give better results than when the registered data from satellite are used. In order to improve these results of classification, in this work, the LIDAR sensor data from first return (Light Detection And Ranging) registered simultaneously with the spectral sensor data from airborne are jointly used. The final results of the thematic classification of the scene object of study have been obtained, quantified and discussed with and without LIDAR data, after applying different methods: Maximum Likehood Classification, Support Vector Machine with four different functions kernel and Isodata clustering algorithm (ML, SVM-L, SVM-P, SVM-RBF, SVM-S, Isodata). The best results are obtained for SVM with Sigmoide kernel. These allow the correlation with others different physical parameters with great interest like Manning hydraulic coefficient, for their incorporation in a GIS and their application in hydraulic modeling.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Quality assessment is a key factor for stereoscopic 3D video content as some observers are affected by visual discomfort in the eye when viewing 3D video, especially when combining positive and negative parallax with fast motion. In this paper, we propose techniques to assess objective quality related to motion and depth maps, which facilitate depth perception analysis. Subjective tests were carried out in order to understand the source of the problem. Motion is an important feature affecting 3D experience but also often the cause of visual discomfort. The automatic algorithm developed tries to quantify the impact on viewer experience when common cases of discomfort occur, such as high-motion sequences, scene changes with abrupt parallax changes, or complete absence of stereoscopy, with a goal of preventing the viewer from having a bad stereoscopic experience.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this paper, we present a depth-color scene modeling strategy for indoors 3D contents generation. It combines depth and visual information provided by a low-cost active depth camera to improve the accuracy of the acquired depth maps considering the different dynamic nature of the scene elements. Accurate depth and color models of the scene background are iteratively built, and used to detect moving elements in the scene. The acquired depth data is continuously processed with an innovative joint-bilateral filter that efficiently combines depth and visual information thanks to the analysis of an edge-uncertainty map and the detected foreground regions. The main advantages of the proposed approach are: removing depth maps spatial noise and temporal random fluctuations; refining depth data at object boundaries, generating iteratively a robust depth and color background model and an accurate moving object silhouette.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Perceptual voice evaluation according to the GRBAS scale is modelled using a linear combination of acoustic parameters calculated after a filter-bank analysis of the recorded voice signals. Modelling results indicate that for breathiness and asthenia more than 55% of the variance of perceptual rates can be explained by such a model, with only 4 latent variables. Moreover, the greatest part of the explained variance can be attributed to only one or two latent variables similarly weighted by all 5 listeners involved in the experiment. Correlation factors between actual rates and model predictions around 0.6 are obtained.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Averaged event-related potential (ERP) data recorded from the human scalp reveal electroencephalographic (EEG) activity that is reliably time-locked and phase-locked to experimental events. We report here the application of a method based on information theory that decomposes one or more ERPs recorded at multiple scalp sensors into a sum of components with fixed scalp distributions and sparsely activated, maximally independent time courses. Independent component analysis (ICA) decomposes ERP data into a number of components equal to the number of sensors. The derived components have distinct but not necessarily orthogonal scalp projections. Unlike dipole-fitting methods, the algorithm does not model the locations of their generators in the head. Unlike methods that remove second-order correlations, such as principal component analysis (PCA), ICA also minimizes higher-order dependencies. Applied to detected—and undetected—target ERPs from an auditory vigilance experiment, the algorithm derived ten components that decomposed each of the major response peaks into one or more ICA components with relatively simple scalp distributions. Three of these components were active only when the subject detected the targets, three other components only when the target went undetected, and one in both cases. Three additional components accounted for the steady-state brain response to a 39-Hz background click train. Major features of the decomposition proved robust across sessions and changes in sensor number and placement. This method of ERP analysis can be used to compare responses from multiple stimuli, task conditions, and subject states.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Hearing is one of the last sensory modalities to be subjected to genetic analysis in Drosophila melanogaster. We describe a behavioral assay for auditory function involving courtship among groups of males triggered by the pulse component of the courtship song. In a mutagenesis screen for mutations that disrupt the auditory response, we have recovered 15 mutations that either reduce or abolish this response. Mutant audiograms indicate that seven mutants reduced the amplitude of the response at all intensities. Another seven abolished the response altogether. The other mutant, 5L3, responded only at high sound intensities, indicating that the threshold was shifted in this mutant. Six mutants were characterized in greater detail. 5L3 had a general courtship defect; courtship of females by 5L3 males also was affected strongly. 5P1 males courted females normally but had reduced success at copulation. 5P1 and 5N18 showed a significant decrement in olfactory response, indicating that the defects in these mutations are not specific to the auditory pathway. Two other mutants, 5M8 and 5N30, produced amotile sperm although in 5N30 this phenotype was genetically separable from the auditory phenotype. Finally, a new adult circling behavior phenotype, the pirouette phenotype, associated with massive neurodegeneration in the brain, was discovered in two mutants, 5G10 and 5N18. This study provides the basis for a genetic and molecular dissection of auditory mechanosensation and auditory behavior.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A set of ten RADARSAT-2 images acquired in fully polarimetric mode over a test site with rice fields in Seville, Spain, has been analyzed to extract the main features of the C-band radar backscatter as a function of rice phenology. After observing the evolutions versus phenology of different polarimetric observables and explaining their behavior in terms of scattering mechanisms present in the scene, a simple retrieval approach has been proposed. This algorithm is based on three polarimetric observables and provides estimates from a set of four relevant intervals of phenological stages. The validation against ground data, carried out at parcel level for a set of six stands and up to nine dates per stand, provides a 96% rate of coincidence. Moreover, an equivalent compact-pol retrieval algorithm has been also proposed and validated, providing the same performance at parcel level. In all cases, the inversion is carried out by exploiting a single satellite acquisition, without any other auxiliary information.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

"COO-2118-0028."

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Thesis (Ph.D.)--University of Washington, 2016-06

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Thesis (Master's)--University of Washington, 2016-06

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Remotely sensed data have been used extensively for environmental monitoring and modeling at a number of spatial scales; however, a limited range of satellite imaging systems often. constrained the scales of these analyses. A wider variety of data sets is now available, allowing image data to be selected to match the scale of environmental structure(s) or process(es) being examined. A framework is presented for use by environmental scientists and managers, enabling their spatial data collection needs to be linked to a suitable form of remotely sensed data. A six-step approach is used, combining image spatial analysis and scaling tools, within the context of hierarchy theory. The main steps involved are: (1) identification of information requirements for the monitoring or management problem; (2) development of ideal image dimensions (scene model), (3) exploratory analysis of existing remotely sensed data using scaling techniques, (4) selection and evaluation of suitable remotely sensed data based on the scene model, (5) selection of suitable spatial analytic techniques to meet information requirements, and (6) cost-benefit analysis. Results from a case study show that the framework provided an objective mechanism to identify relevant aspects of the monitoring problem and environmental characteristics for selecting remotely sensed data and analysis techniques.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Objective: To examine the relationship between the auditory brain-stem response (ABR) and its reconstructed waveforms following discrete wavelet transformation (DWT), and to comment on the resulting implications for ABR DWT time-frequency analysis. Methods: ABR waveforms were recorded from 120 normal hearing subjects at 90, 70, 50, 30, 10 and 0 dBnHL, decomposed using a 6 level discrete wavelet transformation (DWT), and reconstructed at individual wavelet scales (frequency ranges) A6, D6, D5 and D4. These waveforms were then compared for general correlations, and for patterns of change due to stimulus level, and subject age, gender and test ear. Results: The reconstructed ABR DWT waveforms showed 3 primary components: a large-amplitude waveform in the low-frequency A6 scale (0-266.6 Hz) with its single peak corresponding in latency with ABR waves III and V; a mid-amplitude waveform in the mid-frequency D6 scale (266.6-533.3 Hz) with its first 5 waves corresponding in latency to ABR waves 1, 111, V, VI and VII; and a small-amplitude, multiple-peaked waveform in the high-frequency D5 scale (533.3-1066.6 Hz) with its first 7 waves corresponding in latency to ABR waves 1, 11, 111, IV, V, VI and VII. Comparisons between ABR waves 1, 111 and V and their corresponding reconstructed ABR DWT waves showed strong correlations and similar, reliable, and statistically robust changes due to stimulus level and subject age, gender and test ear groupings. Limiting these findings, however, was the unexplained absence of a small number (2%, or 117/6720) of reconstructed ABR DWT waves, despite their corresponding ABR waves being present. Conclusions: Reconstructed ABR DWT waveforms can be used as valid time-frequency representations of the normal ABR, but with some limitations. In particular, the unexplained absence of a small number of reconstructed ABR DWT waves in some subjects, probably resulting from 'shift invariance' inherent to the DWT process, needs to be addressed. Significance: This is the first report of the relationship between the ABR and its reconstructed ABR DWT waveforms in a large normative sample. (C) 2004 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Objective: To use the over-complete discrete wavelet transform (OCDWT) to further examine the dual structure of auditory brainstem response (ABR) in the dog. Methods: ABR waveforms recorded from 20 adult dogs at supra-threshold (90 and 70 dBnHL) and threshold (0-15 dBSL) levels were decomposed using a six level OCDWT and reconstructed at individual scales (frequency ranges) A6 (0-391 Hz), D6 (391-781 Hz), and D5 (781-1563 Hz). Results: At supra-threshold stimulus levels, the A6 scale (0-391 Hz) showed a large amplitude waveform with its prominent wave corresponding in latency with ABR waves II/III; the D6 scale (391-781 Hz) showed a small amplitude waveform with its first four waves corresponding in latency to ABR waves I, II/III, V, and VI; and the D5 scale (781-1563 Hz) showed a large amplitude, multiple peaked waveform with its first six waves corresponding in latency to ABR waves I, II, III, IV, V, and VI. At threshold stimulus levels (0-15 dBSL), the A6 scale (0-391 Hz) continued to show a relatively large amplitude waveform, but both the D6 and D5 scales (391781 and 781-1563 Hz, respectively) now showed relatively small amplitude waveforms. Conclusions: A dual structure exists within the ABR of the dog, but its relative structure changes with stimulus level. Significance: The ABR in the dog differs from that in the human both in the relative contributions made by its different frequency components, and the way these components change with stimulus level. (c) 2006 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper describes the development of an instrument to assess coping strategies for auditory hallucinations. An inventory of coping strategies was obtained by conducting semi-structured interviews with 17 male participants. This inventory was then used to develop a 27-item questionnaire, the Responses to Auditory Hallucinations Questionnaire (RAHQ). The RAHQ was administered to 125 respondents. Measures of symptom severity, appraisal, anxiety, depression and coping dissatisfaction were also administered. Factor Analysis of the RAHQ yielded three coping subscales, Active coping, Passive coping and Suppression coping. The subscales were shown to be empirically distinct and to possess satisfactory internal reliability. For a small subgroup of participants, two of the three subscales demonstrated satisfactory test-retest reliability. Construct validity was assessed within a stress and coping framework. The RAHQ will facilitate the investigation of the efficacy of coping strategies for the management of auditory hallucinations.