16 resultados para auditory
em Indian Institute of Science - Bangalore - Índia
Resumo:
We address the problem of estimating the fundamental frequency of voiced speech. We present a novel solution motivated by the importance of amplitude modulation in sound processing and speech perception. The new algorithm is based on a cumulative spectrum computed from the temporal envelope of various subbands. We provide theoretical analysis to derive the new pitch estimator based on the temporal envelope of the bandpass speech signal. We report extensive experimental performance for synthetic as well as natural vowels for both realworld noisy and noise-free data. Experimental results show that the new technique performs accurate pitch estimation and is robust to noise. We also show that the technique is superior to the autocorrelation technique for pitch estimation.
Resumo:
The ability of the continuous wavelet transform (CWT) to provide good time and frequency localization has made it a popular tool in time-frequency analysis of signals. Wavelets exhibit constant-Q property, which is also possessed by the basilar membrane filters in the peripheral auditory system. The basilar membrane filters or auditory filters are often modeled by a Gammatone function, which provides a good approximation to experimentally determined responses. The filterbank derived from these filters is referred to as a Gammatone filterbank. In general, wavelet analysis can be likened to a filterbank analysis and hence the interesting link between standard wavelet analysis and Gammatone filterbank. However, the Gammatone function does not exactly qualify as a wavelet because its time average is not zero. We show how bona fide wavelets can be constructed out of Gammatone functions. We analyze properties such as admissibility, time-bandwidth product, vanishing moments, which are particularly relevant in the context of wavelets. We also show how the proposed auditory wavelets are produced as the impulse response of a linear, shift-invariant system governed by a linear differential equation with constant coefficients. We propose analog circuit implementations of the proposed CWT. We also show how the Gammatone-derived wavelets can be used for singularity detection and time-frequency analysis of transient signals. (C) 2013 Elsevier B.V. All rights reserved.
Resumo:
Non-stationary signal modeling is a well addressed problem in the literature. Many methods have been proposed to model non-stationary signals such as time varying linear prediction and AM-FM modeling, the later being more popular. Estimation techniques to determine the AM-FM components of narrow-band signal, such as Hilbert transform, DESA1, DESA2, auditory processing approach, ZC approach, etc., are prevalent but their robustness to noise is not clearly addressed in the literature. This is critical for most practical applications, such as in communications. We explore the robustness of different AM-FM estimators in the presence of white Gaussian noise. Also, we have proposed three new methods for IF estimation based on non-uniform samples of the signal and multi-resolution analysis. Experimental results show that ZC based methods give better results than the popular methods such as DESA in clean condition as well as noisy condition.
Resumo:
We present a generic theory for the dynamics of a stiff filament under tension, in an active medium with orientational correlations, such as a microtubule in contractile actin. In sharp contrast to the case of a passive medium, we find the filament can stiffen, and possibly oscillate or buckle, depending on both the contractile or tensile nature of the activity and the filament-medium anchoring interaction. We also demonstrate a strong violation of the fluctuation-dissipation (FD) relation in the effective dynamics of the filament, including a negative FD ratio. Our approach is also of relevance to the dynamics of axons, and our model equations bear a remarkable formal similarity to those in recent work [Martin P, Hudspeth AJ, Juelicher F (2001) Proc Natl Acad Sci USA 98: 14380-14385] on auditory hair cells. Detailed tests of our predictions can be made by using a single filament in actomyosin extracts or bacterial suspensions.
Resumo:
Individuals in distress emit audible vocalizations to either warn or inform conspecifics. The Indian short-nosed fruit bat, Cynopterus sphinx, emits distress calls soon after becoming entangled in mist nets, which appear to attract conspecifics. Phase I of these distress calls is longer and louder, and includes a secondary peak, compared to phase II. Activity-dependent expression of egr-1 was examined in free-ranging C. sphinx following the emissions and responses to a distress call. We found that the level of expression of egr-1 was higher in bats that emitted a distress call, in adults that responded, and in pups than in silent bats. Up-regulated cDNA was amplified to identify the target gene (TOE1) of the protein Egr-1. The observed expression pattern Toe1 was similar to that of egr-1. These findings suggest that the neuronal activity related to recognition of a distress call and an auditory feedback mechanism induces the expression of Egr-1. Co-expression of egr-1 with Toe1 may play a role in initial triggering of the genetic mechanism that could be involved in the consolidation or stabilization of distress call memories.
Resumo:
Crickets have two tympanal membranes on the tibiae of each foreleg. Among several field cricket species of the genus Gryllus (Gryllinae), the posterior tympanal membrane (PTM) is significantly larger than the anterior membrane (ATM). Laser Doppler vibrometric measurements have shown that the smaller ATM does not respond as much as the PTM to sound. Hence the PTM has been suggested to be the principal tympanal acoustic input to the auditory organ. In tree crickets (Oecanthinae), the ATM is slightly larger than the PTM. Both membranes are structurally complex, presenting a series of transverse folds on their surface, which are more pronounced on the ATM than on the PTM. The mechanical response of both membranes to acoustic stimulation was investigated using microscanning laser Doppler vibrometry. Only a small portion of the membrane surface deflects in response to sound. Both membranes exhibit similar frequency responses, and move out of phase with each other, producing compressions and rarefactions of the tracheal volume backing the tympanum. Therefore, unlike field crickets, tree crickets may have four instead of two functional tympanal membranes. This is interesting in the context of the outstanding question of the role of spiracular inputs in the auditory system of tree crickets.
Resumo:
Synchronising bushcricket males achieve synchrony by delaying their chirps in response to calling neighbours. In multi-male choruses, males that delay chirps in response to all their neighbours would remain silent most of the time and be unable to attract mates. This problem could be overcome if the afferent auditory system exhibited selective attention, and thus a male interacted only with a subset of neighbours. We investigated whether individuals of the bushcricket genus Mecopoda restricted their attention to louder chirps neurophysiologically, behaviourally and through spacing. We found that louder leading chirps were preferentially represented in the omega neuron but the representation of softer following chirps was not completely abolished. Following chirps that were 20 dB louder than leading chirps were better represented than leading chirps. During acoustic interactions, males synchronised with leading chirps even when the following chirps were 20 dB louder. Males did not restrict their attention to louder chirps during interactions but were affected by all chirps above a particular threshold. In the field, we found that males on average had only one or two neighbours whose calls were above this threshold. Selective attention is thus achieved in this bushcricket through spacing rather than neurophysiological filtering of softer signals.
Resumo:
Synchronising bushcricket males achieve synchrony by delaying their chirps in response to calling neighbours. In multi-male choruses, males that delay chirps in response to all their neighbours would remain silent most of the time and be unable to attract mates. This problem could be overcome if the afferent auditory system exhibited selective attention, and thus a male interacted only with a subset of neighbours. We investigated whether individuals of the bushcricket genus Mecopoda restricted their attention to louder chirps neurophysiologically, behaviourally and through spacing. We found that louder leading chirps were preferentially represented in the omega neuron but the representation of softer following chirps was not completely abolished. Following chirps that were 20 dB louder than leading chirps were better represented than leading chirps. During acoustic interactions, males synchronised with leading chirps even when the following chirps were 20 dB louder. Males did not restrict their attention to louder chirps during interactions but were affected by all chirps above a particular threshold. In the field, we found that males on average had only one or two neighbours whose calls were above this threshold. Selective attention is thus achieved in this bushcricket through spacing rather than neurophysiological filtering of softer signals.
Resumo:
Synchronising bushcricket males achieve synchrony by delaying their chirps in response to calling neighbours. In multi-male choruses, males that delay chirps in response to all their neighbours would remain silent most of the time and be unable to attract mates. This problem could be overcome if the afferent auditory system exhibited selective attention, and thus a male interacted only with a subset of neighbours. We investigated whether individuals of the bushcricket genus Mecopoda restricted their attention to louder chirps neurophysiologically, behaviourally and through spacing. We found that louder leading chirps were preferentially represented in the omega neuron but the representation of softer following chirps was not completely abolished. Following chirps that were 20 dB louder than leading chirps were better represented than leading chirps. During acoustic interactions, males synchronised with leading chirps even when the following chirps were 20 dB louder. Males did not restrict their attention to louder chirps during interactions but were affected by all chirps above a particular threshold. In the field, we found that males on average had only one or two neighbours whose calls were above this threshold. Selective attention is thus achieved in this bushcricket through spacing rather than neurophysiological filtering of softer signals.
Resumo:
Animals communicate in non-ideal and noisy conditions. The primary method they use to improve communication efficiency is sender-receiver matching: the receiver's sensory mechanism filters the impinging signal based on the expected signal. In the context of acoustic communication in crickets, such a match is made in the frequency domain. The males broadcast a mate attraction signal, the calling song, in a narrow frequency band centred on the carrier frequency (CF), and the females are most sensitive to sound close to this frequency. In tree crickets, however, the CF changes with temperature. The mechanisms used by female tree crickets to accommodate this change in CF were investigated at the behavioural and biomechanical level. At the behavioural level, female tree crickets were broadly tuned and responded equally to CFs produced within the naturally occurring range of temperatures (18 to 27 degrees C). To allow such a broad response, however, the transduction mechanisms that convert sound into mechanical and then neural signals must also have a broad response. The tympana of the female tree crickets exhibited a frequency response that was even broader than suggested by the behaviour. Their tympana vibrate with equal amplitude to frequencies spanning nearly an order of magnitude. Such a flat frequency response is unusual in biological systems and cannot be modelled as a simple mechanical system. This feature of the tree cricket auditory system not only has interesting implications for mate choice and species isolation but may also prove exciting for bio-mimetic applications such as the design of miniature low frequency microphones.
Resumo:
Background & objectives: There is a need to develop an affordable and reliable tool for hearing screening of neonates in resource constrained, medically underserved areas of developing nations. This study valuates a strategy of health worker based screening of neonates using a low cost mechanical calibrated noisemaker followed up with parental monitoring of age appropriate auditory milestones for detecting severe-profound hearing impairment in infants by 6 months of age. Methods: A trained health worker under the supervision of a qualified audiologist screened 425 neonates of whom 20 had confirmed severe-profound hearing impairment. Mechanical calibrated noisemakers of 50, 60, 70 and 80 dB (A) were used to elicit the behavioural responses. The parents of screened neonates were instructed to monitor the normal language and auditory milestones till 6 months of age. This strategy was validated against the reference standard consisting of a battery of tests - namely, auditory brain stem response (ABR), otoacoustic emissions (OAE) and behavioural assessment at 2 years of age. Bayesian prevalence weighted measures of screening were calculated. Results: The sensitivity and specificity was high with least false positive referrals for. 70 and 80 dB (A) noisemakers. All the noisemakers had 100 per cent negative predictive value. 70 and 80 dB (A) noisemakers had high positive likelihood ratios of 19 and 34, respectively. The probability differences for pre- and post- test positive was 43 and 58 for 70 and 80 dB (A) noisemakers, respectively. Interpretation & conclusions: In a controlled setting, health workers with primary education can be trained to use a mechanical calibrated noisemaker made of locally available material to reliably screen for severe-profound hearing loss in neonates. The monitoring of auditory responses could be done by informed parents. Multi-centre field trials of this strategy need to be carried out to examine the feasibility of community health care workers using it in resource constrained settings of developing nations to implement an effective national neonatal hearing screening programme.
Resumo:
Low-frequency sounds are advantageous for long-range acoustic signal transmission, but for small animals they constitute a challenge for signal detection and localization. The efficient detection of sound in insects is enhanced by mechanical resonance either in the tracheal or tympanal system before subsequent neuronal amplification. Making small structures resonant at low sound frequencies poses challenges for insects and has not been adequately studied. Similarly, detecting the direction of long-wavelength sound using interaural signal amplitude and/or phase differences is difficult for small animals. Pseudophylline bushcrickets predominantly call at high, often ultrasonic frequencies, but a few paleotropical species use lower frequencies. We investigated the mechanical frequency tuning of the tympana of one such species, Onomarchus uninotatus, a large bushcricket that produces a narrow bandwidth call at an unusually low carrier frequency of 3.2. kHz. Onomarchus uninotatus, like most bushcrickets, has two large tympanal membranes on each fore-tibia. We found that both these membranes vibrate like hinged flaps anchored at the dorsal wall and do not show higher modes of vibration in the frequency range investigated (1.5-20. kHz). The anterior tympanal membrane acts as a low-pass filter, attenuating sounds at frequencies above 3.5. kHz, in contrast to the high-pass filter characteristic of other bushcricket tympana. Responses to higher frequencies are partitioned to the posterior tympanal membrane, which shows maximal sensitivity at several broad frequency ranges, peaking at 3.1, 7.4 and 14.4. kHz. This partitioning between the two tympanal membranes constitutes an unusual feature of peripheral auditory processing in insects. The complex tracheal shape of O. uninotatus also deviates from the known tube or horn shapes associated with simple band-pass or high-pass amplification of tracheal input to the tympana. Interestingly, while the anterior tympanal membrane shows directional sensitivity at conspecific call frequencies, the posterior tympanal membrane is not directional at conspecific frequencies and instead shows directionality at higher frequencies.
Resumo:
In this paper, the authors study the structure of a novel binaural sound with a certain phase and amplitude modulation and the response to this excitation when it is applied to natural rewarding circuit of human brain through auditory neural pathways. This novel excitation, also referred to as gyrosonic excitation in this work, has been found to have interesting effects such as stabilization effects on the left and right hemispheric brain signaling as captured by Galvanic Skin Resistance (GSR) measurements, control of cardiac rhythms (observed from ECG signals), mitigation of psychosomatic syndrome, and mitigation of migraine pain. Experimental data collected from human subjects are presented, and these data are examined to categorize the extent of systems disorder and reinforcement reward due to the gyrosonic stimulus. A multi-path reduced-order model has been developed to analyze the GSR signals. The filtered results are indicative of complicated reinforcing reward patterns due to the gyrosonic stimulation when it is used as a control input for patients with psychosomatic and cardiac disorders.
Resumo:
We develop noise robust features using Gammatone wavelets derived from the popular Gammatone functions. These wavelets incorporate the characteristics of human peripheral auditory systems, in particular the spatially-varying frequency response of the basilar membrane. We refer to the new features as Gammatone Wavelet Cepstral Coefficients (GWCC). The procedure involved in extracting GWCC from a speech signal is similar to that of the conventional Mel-Frequency Cepstral Coefficients (MFCC) technique, with the difference being in the type of filterbank used. We replace the conventional mel filterbank in MFCC with a Gammatone wavelet filterbank, which we construct using Gammatone wavelets. We also explore the effect of Gammatone filterbank based features (Gammatone Cepstral Coefficients (GCC)) for robust speech recognition. On AURORA 2 database, a comparison of GWCCs and GCCs with MFCCs shows that Gammatone based features yield a better recognition performance at low SNRs.
Binaural Signal Processing Motivated Generalized Analytic Signal Construction and AM-FM Demodulation
Resumo:
Binaural hearing studies show that the auditory system uses the phase-difference information in the auditory stimuli for localization of a sound source. Motivated by this finding, we present a method for demodulation of amplitude-modulated-frequency-modulated (AM-FM) signals using a ignal and its arbitrary phase-shifted version. The demodulation is achieved using two allpass filters, whose impulse responses are related through the fractional Hilbert transform (FrHT). The allpass filters are obtained by cosine-modulation of a zero-phase flat-top prototype halfband lowpass filter. The outputs of the filters are combined to construct an analytic signal (AS) from which the AM and FM are estimated. We show that, under certain assumptions on the signal and the filter structures, the AM and FM can be obtained exactly. The AM-FM calculations are based on the quasi-eigenfunction approximation. We then extend the concept to the demodulation of multicomponent signals using uniform and non-uniform cosine-modulated filterbank (FB) structures consisting of flat bandpass filters, including the uniform cosine-modulated, equivalent rectangular bandwidth (ERB), and constant-Q filterbanks. We validate the theoretical calculations by considering application on synthesized AM-FM signals and compare the performance in presence of noise with three other multiband demodulation techniques, namely, the Teager-energy-based approach, the Gabor's AS approach, and the linear transduction filter approach. We also show demodulation results for real signals.