940 resultados para SPECTRAL CUES
Resumo:
Noise-vocoded (NV) speech is often regarded as conveying phonetic information primarily through temporal-envelope cues rather than spectral cues. However, listeners may infer the formant frequencies in the vocal-tract output—a key source of phonetic detail—from across-band differences in amplitude when speech is processed through a small number of channels. The potential utility of this spectral information was assessed for NV speech created by filtering sentences into six frequency bands, and using the amplitude envelope of each band (=30 Hz) to modulate a matched noise-band carrier (N). Bands were paired, corresponding to F1 (˜N1 + N2), F2 (˜N3 + N4) and the higher formants (F3' ˜ N5 + N6), such that the frequency contour of each formant was implied by variations in relative amplitude between bands within the corresponding pair. Three-formant analogues (F0 = 150 Hz) of the NV stimuli were synthesized using frame-by-frame reconstruction of the frequency and amplitude of each formant. These analogues were less intelligible than the NV stimuli or analogues created using contours extracted from spectrograms of the original sentences, but more intelligible than when the frequency contours were replaced with constant (mean) values. Across-band comparisons of amplitude envelopes in NV speech can provide phonetically important information about the frequency contours of the underlying formants.
Resumo:
Dans le cas de perte auditive, la localisation spatiale est amoindrie et vient entraver la compréhension de la parole et ce, malgré le port de prothèses auditives. La présente étude modifie la forme de l’oreille externe d’individus à l’aide de silicone afin d’induire des changements aux indices spectraux (HRTFs), similaires à ceux causés par des prothèses auditives, et d’explorer les mécanismes perceptifs (visuel, spectral, ou tactile) permettant d’alterner d’un nouvel ensemble à l’ensemble originel de HRTFs une fois les prothèses enlevées. Les résultats démontrent que les participants s’adaptent aux nouveaux HRTFs à l’intérieur de quatre séances d’entraînement. Dès le retrait des prothèses, les participants reviennent à leur performance originale. Il n’est pas possible de conclure avec les données présentes si le changement d’un ensemble de HRTFs à un autre est influencé par un des mécanismes de rétroaction perceptuelle étudié. L’adaptation aux prothèses perdure jusqu’à quatre semaines après leur retrait.
Resumo:
Leao RM, Li S, Doiron B, Tzounopoulos T. Diverse levels of an inwardly rectifying potassium conductance generate heterogeneous neuronal behavior in a population of dorsal cochlear nucleus pyramidal neurons. J Neurophysiol 107: 3008-3019, 2012. First published February 29, 2012; doi:10.1152/jn.00660.2011.-Homeostatic mechanisms maintain homogeneous neuronal behavior among neurons that exhibit substantial variability in the expression levels of their ionic conductances. In contrast, the mechanisms, which generate heterogeneous neuronal behavior across a neuronal population, remain poorly understood. We addressed this problem in the dorsal cochlear nucleus, where principal neurons exist in two qualitatively distinct states: spontaneously active or not spontaneously active. Our studies reveal that distinct activity states are generated by the differential levels of a Ba2+-sensitive, inwardly rectifying potassium conductance (K-ir). Variability in K-ir maximal conductance causes variations in the resting membrane potential (RMP). Low K-ir conductance depolarizes RMP to voltages above the threshold for activating subthreshold-persistent sodium channels (Na-p). Once Na-p channels are activated, the RMP becomes unstable, and spontaneous firing is triggered. Our results provide a biophysical mechanism for generating neural heterogeneity, which may play a role in the encoding of sensory information.
Resumo:
Echolocating big brown bats (Eptesicus fuscus) broadcast ultrasonic frequency-modulated (FM) biosonar sounds (20–100 kHz frequencies; 10–50 μs periods) and perceive target range from echo delay. Knowing the acuity for delay resolution is essential to understand how bats process echoes because they perceive target shape and texture from the delay separation of multiple reflections. Bats can separately perceive the delays of two concurrent electronically generated echoes arriving as little as 2 μs apart, thus resolving reflecting points as close together as 0.3 mm in range (two-point threshold). This two-point resolution is roughly five times smaller than the shortest periods in the bat’s sounds. Because the bat’s broadcasts are 2,000–4,500 μs long, the echoes themselves overlap and interfere with each other, to merge together into a single sound whose spectrum is shaped by their mutual interference depending on the size of the time separation. To separately perceive the delays of overlapping echoes, the bat has to recover information about their very small delay separation that was transferred into the spectrum when the two echoes interfered with each other, thus explicitly reconstructing the range profile of targets from the echo spectrum. However, the bat’s 2-μs resolution limit is so short that the available spectral cues are extremely limited. Resolution of delay seems overly sharp just for interception of flying insects, which suggests that the bat’s biosonar images are of higher quality to suit a wider variety of orientation tasks, and that biosonar echo processing is correspondingly more sophisticated than has been suspected.
Resumo:
Sound localization relies on the neural processing of monaural and binaural spatial cues that arise from the way sounds interact with the head and external ears. Neurophysiological studies of animals raised with abnormal sensory inputs show that the map of auditory space in the superior colliculus is shaped during development by both auditory and visual experience. An example of this plasticity is provided by monaural occlusion during infancy, which leads to compensatory changes in auditory spatial tuning that tend to preserve the alignment between the neural representations of visual and auditory space. Adaptive changes also take place in sound localization behavior, as demonstrated by the fact that ferrets raised and tested with one ear plugged learn to localize as accurately as control animals. In both cases, these adjustments may involve greater use of monaural spectral cues provided by the other ear. Although plasticity in the auditory space map seems to be restricted to development, adult ferrets show some recovery of sound localization behavior after long-term monaural occlusion. The capacity for behavioral adaptation is, however, task dependent, because auditory spatial acuity and binaural unmasking (a measure of the spatial contribution to the “cocktail party effect”) are permanently impaired by chronically plugging one ear, both in infancy but especially in adulthood. Experience-induced plasticity allows the neural circuitry underlying sound localization to be customized to individual characteristics, such as the size and shape of the head and ears, and to compensate for natural conductive hearing losses, including those associated with middle ear disease in infancy.
Resumo:
One of the most popular techniques for creating spatialized virtual sounds is based on the use of Head-Related Transfer Functions (HRTFs). HRTFs are signal processing models that represent the modifications undergone by the acoustic signal as it travels from a sound source to each of the listener's eardrums. These modifications are due to the interaction of the acoustic waves with the listener's torso, shoulders, head and pinnae, or outer ears. As such, HRTFs are somewhat different for each listener. For a listener to perceive synthesized 3-D sound cues correctly, the synthesized cues must be similar to the listener's own HRTFs. ^ One can measure individual HRTFs using specialized recording systems, however, these systems are prohibitively expensive and restrict the portability of the 3-D sound system. HRTF-based systems also face several computational challenges. This dissertation presents an alternative method for the synthesis of binaural spatialized sounds. The sound entering the pinna undergoes several reflective, diffractive and resonant phenomena, which determine the HRTF. Using signal processing tools, such as Prony's signal modeling method, an appropriate set of time delays and a resonant frequency were used to approximate the measured Head-Related Impulse Responses (HRIRs). Statistical analysis was used to find out empirical equations describing how the reflections and resonances are determined by the shape and size of the pinna features obtained from 3D images of 15 experimental subjects modeled in the project. These equations were used to yield “Model HRTFs” that can create elevation effects. ^ Listening tests conducted on 10 subjects show that these model HRTFs are 5% more effective than generic HRTFs when it comes to localizing sounds in the frontal plane. The number of reversals (perception of sound source above the horizontal plane when actually it is below the plane and vice versa) was also reduced by 5.7%, showing the perceptual effectiveness of this approach. The model is simple, yet versatile because it relies on easy to measure parameters to create an individualized HRTF. This low-order parameterized model also reduces the computational and storage demands, while maintaining a sufficient number of perceptually relevant spectral cues. ^
Resumo:
Listeners can attend to one of several simultaneous messages by tracking one speaker’s voice characteristics. Using differences in the location of sounds in a room, we ask how well cues arising from spatial position compete with these characteristics. Listeners decided which of two simultaneous target words belonged in an attended “context” phrase when it was played simultaneously with a different “distracter” context. Talker difference was in competition with position difference, so the response indicates which cue‐type the listener was tracking. Spatial position was found to override talker difference in dichotic conditions when the talkers are similar (male). The salience of cues associated with differences in sounds, bearings decreased with distance between listener and sources. These cues are more effective binaurally. However, there appear to be other cues that increase in salience with distance between sounds. This increase is more prominent in diotic conditions, indicating that these cues are largely monaural. Distances between spectra calculated using a gammatone filterbank (with ERB‐spaced CFs) of the room’s impulse responses at different locations were computed, and comparison with listeners’ responses suggested some slight monaural loudness cues, but also monaural “timbre” cues arising from the temporal‐ and spectral‐envelope differences in the speech from different locations.
Resumo:
Purpose. To investigate misalignments (MAs) on retinal nerve fiber layer thickness (RNFLT) measurements obtained with Cirrus(©) SD-OCT. Methods. This was a retrospective, observational, cross-sectional study. Twenty-seven healthy and 29 glaucomatous eyes of 56 individuals with one normal exam and another showing MA were included. MAs were defined as an improper alignment of vertical vessels in the en face image. MAs were classified in complete MA (CMA) and partial MA (PMA), according to their site: 1 (superior, outside the measurement ring (MR)), 2 (superior, within MR), 3 (inferior, within MR), and 4 (inferior, outside MR). We compared RNFLT measurements of aligned versus misaligned exams in all 4 sectors, in the superior area (sectors 1 + 2), inferior area (sectors 3 + 4), and within the measurement ring (sectors 2 + 3). Results. RNFLT measurements at 12 clock-hour of eyes with MAs in the superior area (sectors 1 + 2) were significantly lower than those obtained in the same eyes without MAs (P = 0.043). No significant difference was found in other areas (sectors 1 + 2 + 3 + 4, sectors 3 + 4, and sectors 2 + 3). Conclusion. SD-OCT scans with superior MAs may present lower superior RNFLT measurements compared to aligned exams.
Resumo:
PURPOSE: To evaluate the sensitivity and specificity of machine learning classifiers (MLCs) for glaucoma diagnosis using Spectral Domain OCT (SD-OCT) and standard automated perimetry (SAP). METHODS: Observational cross-sectional study. Sixty two glaucoma patients and 48 healthy individuals were included. All patients underwent a complete ophthalmologic examination, achromatic standard automated perimetry (SAP) and retinal nerve fiber layer (RNFL) imaging with SD-OCT (Cirrus HD-OCT; Carl Zeiss Meditec Inc., Dublin, California). Receiver operating characteristic (ROC) curves were obtained for all SD-OCT parameters and global indices of SAP. Subsequently, the following MLCs were tested using parameters from the SD-OCT and SAP: Bagging (BAG), Naive-Bayes (NB), Multilayer Perceptron (MLP), Radial Basis Function (RBF), Random Forest (RAN), Ensemble Selection (ENS), Classification Tree (CTREE), Ada Boost M1(ADA),Support Vector Machine Linear (SVML) and Support Vector Machine Gaussian (SVMG). Areas under the receiver operating characteristic curves (aROC) obtained for isolated SAP and OCT parameters were compared with MLCs using OCT+SAP data. RESULTS: Combining OCT and SAP data, MLCs' aROCs varied from 0.777(CTREE) to 0.946 (RAN).The best OCT+SAP aROC obtained with RAN (0.946) was significantly larger the best single OCT parameter (p<0.05), but was not significantly different from the aROC obtained with the best single SAP parameter (p=0.19). CONCLUSION: Machine learning classifiers trained on OCT and SAP data can successfully discriminate between healthy and glaucomatous eyes. The combination of OCT and SAP measurements improved the diagnostic accuracy compared with OCT data alone.
Resumo:
The study of tokamak plasma light emissions in the vacuum ultraviolet (VUV) region is an important subject since many impurity spectral emissions are present in this region. These spectral emissions can be used to determine the plasma ion temperature and density from different species and spatial positions inside plasma according to their temperatures. We have analyzed VUV spectra from 500 Å to 3200 Å wavelength in the TCABR tokamak plasma including higher diffraction order emissions. There have been identified 37 first diffraction order emissions, resulting in 28 second diffraction order, 24 third diffraction order, and 7 fourth diffraction order lines. The emissions are from impurity species such as OII, OIII, OIV, OV, OVI, OVII, CII, CIII, CIV, NIII, NIV, and NV. All the spectra beyond 1900 Å are from higher diffraction order emissions, and possess much better spectral resolution. Each strong and isolated spectral line, as well as its higher diffraction order emissions suitable for plasma diagnostic is identified and discussed. Finally, an example of ion temperature determination using different diffraction order is presented.
Resumo:
Objective: The biochemical alterations between inflammatory fibrous hyperplasia (IFH) and normal tissues of buccal mucosa were probed by using the FT-Raman spectroscopy technique. The aim was to find the minimal set of Raman bands that would furnish the best discrimination. Background: Raman-based optical biopsy is a widely recognized potential technique for noninvasive real-time diagnosis. However, few studies had been devoted to the discrimination of very common subtle or early pathologic states as inflammatory processes that are always present on, for example, cancer lesion borders. Methods: Seventy spectra of IFH from 14 patients were compared with 30 spectra of normal tissues from six patients. The statistical analysis was performed with principal components analysis and soft independent modeling class analogy cross-validated, leave-one-out methods. Results: Bands close to 574, 1,100, 1,250 to 1,350, and 1,500 cm(-1) (mainly amino acids and collagen bands) showed the main intragroup variations that are due to the acanthosis process in the IFH epithelium. The 1,200 (C-C aromatic/DNA), 1,350 (CH(2) bending/collagen 1), and 1,730 cm(-1) (collagen III) regions presented the main intergroup variations. This finding was interpreted as originating in an extracellular matrix-degeneration process occurring in the inflammatory tissues. The statistical analysis results indicated that the best discrimination capability (sensitivity of 95% and specificity of 100%) was found by using the 530-580 cm(-1) spectral region. Conclusions: The existence of this narrow spectral window enabling normal and inflammatory diagnosis also had useful implications for an in vivo dispersive Raman setup for clinical applications.
Resumo:
Multifilter rotating shadowband radiometer (MFRSR) calibration values for aerosol optical depth (AOD) retrievals were determined by means of the general method formulated by Forgan [Appl. Opt. 33, 4841 (1994)] at a polluted urban site. The obtained precision is comparable with the classical method, the Langley plot, applied on clean mountaintops distant of pollution sources. The AOD retrieved over Sao Paulo City with both calibration procedures is compared with the Aerosol Robotic Network data. The observed results are similar, and, except for the shortest wavelength (415 nm), the MFRSR`s AOD is systematically overestimated by similar to 0.03. (c) 2008 Optical Society of America.
Resumo:
Synoptic spectroscopic observations of the U Sco 2010 outburst from maximum light to quiescence as well as a contemporaneous X-ray observation are presented and analyzed. The X-ray spectrum 52 days after outburst indicates a hot source ( kT(bb) similar to 70 eV). Narrow-line components from the irradiated companion atmosphere were observed in hydrogen and helium optical recombination lines. The formation of a nebular spectrum is seen for the first time in this class of recurrent novae, allowing a detailed study of the ejecta using photoionization models. Unusual [O III] auroral-to-nebular line ratios were found and possible scenarios of their origin are discussed. The modeling of the emission line spectrum suggests highly heterogeneous ejecta with masses around or above 3 x 10(-6) M(sun).
Resumo:
The HR Del nova remnant was observed with the IFU-GMOS at Gemini North. The spatially resolved spectral data cube was used in the kinematic, morphological, and abundance analysis of the ejecta. The line maps show a very clumpy shell with two main symmetric structures. The first one is the outer part of the shell seen in H alpha, which forms two rings projected in the sky plane. These ring structures correspond to a closed hourglass shape, first proposed by Harman & O'Brien. The equatorial emission enhancement is caused by the superimposed hourglass structures in the line of sight. The second structure seen only in the [O III] and [N II] maps is located along the polar directions inside the hourglass structure. Abundance gradients between the polar caps and equatorial region were not found. However, the outer part of the shell seems to be less abundant in oxygen and nitrogen than the inner regions. Detailed 2.5-dimensional photoionization modeling of the three-dimensional shell was performed using the mass distribution inferred from the observations and the presence of mass clumps. The resulting model grids are used to constrain the physical properties of the shell as well as the central ionizing source. A sequence of three-dimensional clumpy models including a disk-shaped ionization source is able to reproduce the ionization gradients between polar and equatorial regions of the shell. Differences between shell axial ratios in different lines can also be explained by aspherical illumination. A total shell mass of 9 x 10(-4) M(circle dot) is derived from these models. We estimate that 50%-70% of the shell mass is contained in neutral clumps with density contrast up to a factor of 30.
Resumo:
Context. Tight binaries discovered in young, nearby associations are ideal targets for providing dynamical mass measurements to test the physics of evolutionary models at young ages and very low masses. Aims. We report the binarity of TWA22 for the first time. We aim at monitoring the orbit of this young and tight system to determine its total dynamical mass using an accurate distance determination. We also intend to characterize the physical properties (luminosity, effective temperature, and surface gravity) of each component based on near-infrared photometric and spectroscopic observations. Methods. We used the adaptive-optics assisted imager NACO to resolve the components, to monitor the complete orbit and to obtain the relative near-infrared photometry of TWA22 AB. The adaptive-optics assisted integral field spectrometer SINFONI was also used to obtain medium-resolution (R(lambda) = 1500-2000) spectra in JHK bands. Comparison with empirical and synthetic librairies were necessary for deriving the spectral type, the effective temperature, and the surface gravity for each component of the system. Results. Based on an accurate trigonometric distance (17.5 +/- 0.2 pc) determination, we infer a total dynamical mass of 220 +/- 21 M(Jup) for the system. From the complete set of spectra, we find an effective temperature T(eff) = 2900(-200)(+200) K for TWA22A and T(eff) = 2900(-100)(+200) for TWA22 B and surface gravities between 4.0 and 5.5 dex. From our photometry and an M6 +/- 1 spectral type for both components, we find luminosities of log(L/L(circle dot)) = -2.11 +/- 0.13 dex and log(L/L(circle dot)) = -2.30 +/- 0.16 dex for TWA22 A and B, respectively. By comparing these parameters with evolutionary models, we question the age and the multiplicity of this system. We also discuss a possible underestimation of the mass predicted by evolutionary models for young stars close to the substellar boundary.