11 resultados para speech signals

em Duke University


Relevância:

70.00% 70.00%

Publicador:

Resumo:

This dissertation focuses on two vital challenges in relation to whale acoustic signals: detection and classification.

In detection, we evaluated the influence of the uncertain ocean environment on the spectrogram-based detector, and derived the likelihood ratio of the proposed Short Time Fourier Transform detector. Experimental results showed that the proposed detector outperforms detectors based on the spectrogram. The proposed detector is more sensitive to environmental changes because it includes phase information.

In classification, our focus is on finding a robust and sparse representation of whale vocalizations. Because whale vocalizations can be modeled as polynomial phase signals, we can represent the whale calls by their polynomial phase coefficients. In this dissertation, we used the Weyl transform to capture chirp rate information, and used a two dimensional feature set to represent whale vocalizations globally. Experimental results showed that our Weyl feature set outperforms chirplet coefficients and MFCC (Mel Frequency Cepstral Coefficients) when applied to our collected data.

Since whale vocalizations can be represented by polynomial phase coefficients, it is plausible that the signals lie on a manifold parameterized by these coefficients. We also studied the intrinsic structure of high dimensional whale data by exploiting its geometry. Experimental results showed that nonlinear mappings such as Laplacian Eigenmap and ISOMAP outperform linear mappings such as PCA and MDS, suggesting that the whale acoustic data is nonlinear.

We also explored deep learning algorithms on whale acoustic data. We built each layer as convolutions with either a PCA filter bank (PCANet) or a DCT filter bank (DCTNet). With the DCT filter bank, each layer has different a time-frequency scale representation, and from this, one can extract different physical information. Experimental results showed that our PCANet and DCTNet achieve high classification rate on the whale vocalization data set. The word error rate of the DCTNet feature is similar to the MFSC in speech recognition tasks, suggesting that the convolutional network is able to reveal acoustic content of speech signals.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

During heart development, a subpopulation of cells in the heart field maintains cardiac potential over several days of development and forms the myocardium and smooth muscle of the arterial pole. Using clonal and explant culture experiments, we show that these cells are a stem cell population that can differentiate into myocardium, smooth muscle and endothelial cells. The multipotent stem cells proliferate or differentiate into different cardiovascular cell fates through activation or inhibition of FGF and BMP signaling pathways. BMP promoted myocardial differentiation but not proliferation. FGF signaling promoted proliferation and induced smooth muscle differentiation, but inhibited myocardial differentiation. Blocking the Ras/Erk intracellular pathway promoted myocardial differentiation, while the PLCgamma and PI3K pathways regulated proliferation. In vivo, inhibition of both pathways resulted in predictable arterial pole defects. These studies suggest that myocardial differentiation of arterial pole progenitors requires BMP signaling combined with downregulation of the FGF/Ras/Erk pathway. The FGF pathway maintains the pool of proliferating stem cells and later promotes smooth muscle differentiation.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The affective impact of music arises from a variety of factors, including intensity, tempo, rhythm, and tonal relationships. The emotional coloring evoked by intensity, tempo, and rhythm appears to arise from association with the characteristics of human behavior in the corresponding condition; however, how and why particular tonal relationships in music convey distinct emotional effects are not clear. The hypothesis examined here is that major and minor tone collections elicit different affective reactions because their spectra are similar to the spectra of voiced speech uttered in different emotional states. To evaluate this possibility the spectra of the intervals that distinguish major and minor music were compared to the spectra of voiced segments in excited and subdued speech using fundamental frequency and frequency ratios as measures. Consistent with the hypothesis, the spectra of major intervals are more similar to spectra found in excited speech, whereas the spectra of particular minor intervals are more similar to the spectra of subdued speech. These results suggest that the characteristic affective impact of major and minor tone collections arises from associations routinely made between particular musical intervals and voiced speech.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

β-arrestins are versatile adapter proteins that form complexes with most G-protein-coupled receptors (GPCRs) following agonist binding and phosphorylation of receptors by G-protein-coupled receptor kinases (GRKs). They play a central role in the interrelated processes of homologous desensitization and GPCR sequestration, which lead to the termination of G protein activation. β-arrestin binding to GPCRs both uncouples receptors from heterotrimeric G proteins and targets them to clathrincoated pits for endocytosis. Recent data suggest that β-arrestins also function as GPCR signal transducers. They can form complexes with several signaling proteins, including Src family tyrosine kinases and components of the ERK1/2 and JNK3 MAP kinase cascades. By recruiting these kinases to agonist-occupied GPCRs, β-arrestins confer distinct signaling activities upon the receptor. β-arrestin-Src complexes have been proposed to modulate GPCR endocytosis, to trigger ERK1/2 activation and to mediate neutrophil degranulation. By acting as scaffolds for the ERK1/2 and JNK3 cascades, β-arrestins both facilitate GPCR-stimulated MAP kinase activation and target active MAP kinases to specific locations within the cell. Thus, their binding to GPCRs might initiate a second wave of signaling and represent a novel mechanism of GPCR signal transduction.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Perceiving or producing complex vocalizations such as speech and birdsongs require the coordinated activity of neuronal populations, and these activity patterns can vary over space and time. How learned communication signals are represented by populations of sensorimotor neurons essential to vocal perception and production remains poorly understood. Using a combination of two-photon calcium imaging, intracellular electrophysiological recording and retrograde tracing methods in anesthetized adult male zebra finches (Taeniopygia guttata), I addressed how the bird's own song and its component syllables are represented by the spatiotemporal patterns of activity of two spatially intermingled populations of projection neurons (PNs) in HVC, a sensorimotor area required for song perception and production. These experiments revealed that neighboring PNs can respond at markedly different times to song playback and that different syllables activate spatially intermingled HVC PNs within a small region. Moreover, noise correlation analysis reveals enhanced functional connectivity between PNs that respond most strongly to the same syllable and also provides evidence of a spatial gradient of functional connectivity specific to PNs that project to song motor nucleus (i.e. HVCRA cells). These findings support a model in which syllabic and temporal features of song are represented by spatially intermingled PNs functionally organized into cell- and syllable-type networks.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Thirty years after fleeing from Poland to Denmark, 20 immigrants were enlisted in a study of bilingual autobiographical memory. Ten "early immigrators" averaged 24 years old at the time of immigration, and ten "late immigrators" averaged 34 years old at immigration. Although all 20 had spent 30 years in Denmark, early immigrators reported more current inner speech behaviours in Danish, whereas late immigrators showed more use of Polish. Both groups displayed proportionally more numerous autobiographical retrievals that were reported as coming to them internally in Polish (vs Danish) for the decades prior to immigration and more in Danish (vs Polish) after immigration. We propose a culture- and language-specific shaping of semantic and conceptual stores that underpins autobiographical and world knowledge.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We perceive a stable visual world even though saccades often move our retinas. One way the brain may achieve a stable visual percept is through predictive remapping of visual receptive fields: just before a saccade, the receptive field of many neurons moves from its current location ("current receptive field") to the location it is expected to occupy after the saccade ("future receptive field"). Goldberg and colleagues found such remapping in cortical areas, e.g. in the frontal eye field (FEF), as well as in the intermediate layers of the superior colliculus (SC). In the present study we investigated the source of the SC's remapped visual signals. Do some of them come from the FEF? We identified FEF neurons that project to the SC using antidromic stimulation. For neurons with a visual response, we tested whether the receptive field shifted just prior to making a saccade. Saccadic amplitudes were chosen to be as small as possible while clearly separating the current and future receptive fields; they ranged from 5-30 deg. in amplitude and were directed contraversively. The saccadic target was a small red spot. We probed visual responsiveness at the current and future receptive field locations using a white spot flashed at various times before or after the saccade. Predictive remapping was indicated by a visual response to a probe flashed in the future receptive field just before the saccade began. We found that many FEF neurons projecting to the SC exhibited predictive remapping. Moreover, the remapping was as fast and strong as any previously reported for FEF or SC. It is clear, therefore, that remapped visual signals are sent from FEF to SC, providing direct evidence that the FEF is one source of the SC's remapped visual signals. Because remapping requires information about an imminent saccade, we hypothesize that remapping in FEF depends on corollary discharge signals such as those ascending from the SC through MD thalamus (Sommer and Wurtz 2002).

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Animals communicating via scent often deposit composite signals that incorporate odorants from multiple sources; however, the function of mixing chemical signals remains understudied. We tested both a 'multiple-messages' and a 'fixative' hypothesis of composite olfactory signalling, which, respectively, posit that mixing scents functions to increase information content or prolong signal longevity. Our subjects-adult, male ring-tailed lemurs (Lemur catta)-have a complex scent-marking repertoire, involving volatile antebrachial (A) secretions, deposited pure or after being mixed with a squalene-rich paste exuded from brachial (B) glands. Using behavioural bioassays, we examined recipient responses to odorants collected from conspecific strangers. We concurrently presented pure A, pure B and mixed A + B secretions, in fresh or decayed conditions. Lemurs preferentially responded to mixed over pure secretions, their interest increasing and shifting over time, from sniffing and countermarking fresh mixtures, to licking and countermarking decayed mixtures. Substituting synthetic squalene (S)-a well-known fixative-for B secretions did not replicate prior results: B secretions, which contain additional chemicals that probably encode salient information, were preferred over pure S. Whereas support for the 'multiple-messages' hypothesis underscores the unique contribution from each of an animal's various secretions, support for the 'fixative' hypothesis highlights the synergistic benefits of composite signals.