873 resultados para Audio-Visual Automatic Speech Recognition
Resumo:
A sociedade digital nos abraça em todos os aspectos do cotidiano e uma parte significativa da população vive conectada em multiplataformas. Com a instantaneidade dos fluxos de comunicação, vivemos uma rotina onde muitos acessos estão a um clique ou toque. A televisão como mídia preponderante durante várias décadas, na sua transição digital comporta uma função além da TV que conhecíamos, como display interativo que se conecta e absorve conteúdos provenientes de várias fontes. Os consagrados modelos mundiais de distribuição de audiovisual, especialmente pelo Broadcast, sofrem as consequências da mudança do comportamento do seu público pelas novas oportunidades de acesso aos conteúdos, agora interativos e sob demanda. Neste contexto, os modelos das SmartTVs (TVs conectadas) em Broadband (Banda Larga) apresentam opções diferenciadas e requerem um espaço cada vez maior na conexão com todos os outros displays. Com este cenário, o presente estudo busca descrever e analisar as novas ofertas de conteúdos, aplicativos, possibilidades e tendências do hibridismo das fontes para a futura TV.
Resumo:
This paper reviews some basic issues and methods involved in using neural networks to respond in a desired fashion to a temporally-varying environment. Some popular network models and training methods are introduced. A speech recognition example is then used to illustrate the central difficulty of temporal data processing: learning to notice and remember relevant contextual information. Feedforward network methods are applicable to cases where this problem is not severe. The application of these methods are explained and applications are discussed in the areas of pure mathematics, chemical and physical systems, and economic systems. A more powerful but less practical algorithm for temporal problems, the moving targets algorithm, is sketched and discussed. For completeness, a few remarks are made on reinforcement learning.
Resumo:
Both attentional difficulties and rapid processing deficits have recently been linked with dyslexia. We report two studies comparing the performance of dyslexic and control teenagers on attentional tasks. The two studies were based on two different conceptions of attention. Study 1 employed a design that allowed three key components of attention - focusing, switching, and sustaining - to be investigated separately. One hypothesis under investigation was that rapid processing problems - in particular impaired ability to switch attention rapidly - might be associated with dyslexia. However, although dyslexic participants were significantly less accurate than their controls in a condition where they had to switch attention between two target types, the nature of the deficit suggested that the problem was not in switching attention per se. Thus, in Study 2, we explored an alternative interpretation of the Study 1 results in terms of the classic capacity-limited models of "central" attention. We contrasted two hypotheses: (1) that dyslexic teenagers have reduced cognitive resources versus (2) that they suffer from a general impairment in the ability to automatise basic skills. To investigate the automaticity of the shape recognition component of the task a similar attention paradigm to that used in Study 1 was employed, but using degraded, as well as intact, stimuli. It was found that stimulus degradation led to relatively less impairment for dyslexic than for matched control groups. The results support the hypothesis that dyslexic people suffer from a general impairment in the ability to automatise skills - in this case the skill of automatic shape recognition.
Resumo:
This chapter provides the theoretical foundation and background on data envelopment analysis (DEA) method. We first introduce the basic DEA models. The balance of this chapter focuses on evidences showing DEA has been extensively applied for measuring efficiency and productivity of services including financial services (banking, insurance, securities, and fund management), professional services, health services, education services, environmental and public services, energy services, logistics, tourism, information technology, telecommunications, transport, distribution, audio-visual, media, entertainment, cultural and other business services. Finally, we provide information on the use of Performance Improvement Management Software (PIM-DEA). A free limited version of this software and downloading procedure is also included in this chapter.
Resumo:
Keyword identification in one of two simultaneous sentences is improved when the sentences differ in F0, particularly when they are almost continuously voiced. Sentences of this kind were recorded, monotonised using PSOLA, and re-synthesised to give a range of harmonic ?F0s (0, 1, 3, and 10 semitones). They were additionally re-synthesised by LPC with the LPC residual frequency shifted by 25% of F0, to give excitation with inharmonic but regularly spaced components. Perceptual identification of frequency-shifted sentences showed a similar large improvement with nominal ?F0 as seen for harmonic sentences, although overall performance was about 10% poorer. We compared performance with that of two autocorrelation-based computational models comprising four stages: (i) peripheral frequency selectivity and half-wave rectification; (ii) within-channel periodicity extraction; (iii) identification of the two major peaks in the summary autocorrelation function (SACF); (iv) a template-based approach to speech recognition using dynamic time warping. One model sampled the correlogram at the target-F0 period and performed spectral matching; the other deselected channels dominated by the interferer and performed matching on the short-lag portion of the residual SACF. Both models reproduced the monotonic increase observed in human performance with increasing ?F0 for the harmonic stimuli, but not for the frequency-shifted stimuli. A revised version of the spectral-matching model, which groups patterns of periodicity that lie on a curve in the frequency-delay plane, showed a closer match to the perceptual data for frequency-shifted sentences. The results extend the range of phenomena originally attributed to harmonic processing to grouping by common spectral pattern.
Resumo:
Much has been written about the marketing aspects of promotional material in general, and several scholars (particularly in linguistics) have addressed questions relating to the structure and function of advertisements, focusing on images, rhetorical structure, semiotic functions, discourse features and audio-visual media, amongst other aspects of the genre. Not much, on the other hand, has been written within translation studies about the complexities involved in the transfer of an advertising message. Contributors to this volume explore various interdependent aspects of the interlingual and intercultural transfer of an advertising message. They emphasize features of culture specificity, of multi-medial semiotic interaction, of values and stereotypes, and most importantly, they recommend strategies and approaches to assist translators. Topics covered include a critique of the Western-based approach to advertising in the context of the Far East; different perceptions of the concept of cleanliness in advertising texts in Italy, Russia and the UK; the Walls Cornetto strategy of internationalization of product appeal, followed by localization; the role of the translator in recreating appeal in different lingua-cultural contexts; what constitutes 'Italianness' in advertisements for British consumers; and strategies for repackaging France as a tourist destination.
Resumo:
Modern technology has moved on and completely changed the way that people can use the telephone or mobile to dialogue with information held on computers. Well developed “written speech analysis” does not work with “verbal speech”. The main purpose of our article is, firstly, to highlights the problems and, secondly, to shows the possible ways to solve these problems.
Resumo:
The article gives an account of the various microfilming initiatives taken in Malta during the last thirty years. Various archives have managed to microfilm their holdings under co-operation agreements with international societies, or manuscript libraries. The advent of digital technology is now posing new challenges and opportunities for the archives sector. The idea of a National Memory Project that will try to bridge the different approaches in the preservation of records in the various public, private, and ecclesiastical archives in Malta is discussed. Technical challenges are highlighted, as are the opportunities that arise from collaboration and active participation in international projects such as the European Visual Archives (EVA), and the SEEDI initiative.
Resumo:
Drawing on the newest findings of politeness research, this paper proposes an interactionally grounded approach to computer-mediated discourse (CMD). Through the analysis of naturally occurring text-based synchronous interactions of a virtual team the paper illustrates that the interactional politeness approach can account for linguistic phenomena not yet fully explored in computer-mediated discourse analysis. Strategies used for compensating for the lack of audio-visual information in computer-mediated communication, strategies to compensate for the technological constraints of the medium, and strategies to aid interaction management are examined from an interactional politeness viewpoint and compared to the previous findings of CMD analysis. The conclusion of this preliminary research suggests that the endeavour to communicate along the lines of politeness norms in a work-based virtual environment contradicts some of the previous findings of CMD research (unconventional orthography, capitalization, economizing), and that other areas (such as emoticons, backchannel signals and turn-taking strategies) need to be revisited and re-examined from an interactional perspective to fully understand how language functions in this merely text-based environment.
Resumo:
It is already a truism that emerging communication technologies have changed the landscape of communication in every aspect of our lives, but this is specifically true for how we communicate at work. Advances in communication technologies have enabled a wide range of digital communication modes to be utilized for both internal and external business communication; including audio and visual communication and voice-over protocols, as well as text-based channels, such as email, forums, instant messaging and social media. In spite of the wide range of available audio-visual channels, and despite the ever-increasing popularity of email, real-time text-based communication technologies (instant messaging or IM) are also on the rise (see Mak, 2014; Pazos et al., 2013; Radicati & Levenstein, 2013; and Markman in this volume). The prominence of IM is evident in the rise of this mode of communication, not only as a tool for internal business communication, but as a front-stage channel, particularly for customer service encounters or professional-client conversations (Makarem et al., 2009; Pearce et al., 2013; L. Zhang et al., 2011).
Resumo:
Online writing plays a complex and increasingly prominent role in the life of organizations. From newsletters to press releases, social media marketing and advertising, to virtual presentations and interactions via e-mail and instant messaging, digital writing intertwines and affects the day-to-day running of the company - yet we rarely pay enough attention to it. Typing on the screen can become particularly problematic because digital text-based communication increases the opportunities for misunderstanding: it lacks the direct audio-visual contact and the norms and conventions that would normally help people to understand each other. Providing a clear, convincing and approachable discussion, this book addresses arenas of online writing: virtual teamwork, instant messaging, emails, corporate communication channels, and social media. Instead of offering do and don’t lists, however, it teaches the reader to develop a practice that is observant, reflective, and grounded in the understanding of the basic principles of language and communication. Through real-life examples and case studies, it helps the reader to notice previously unnoticed small details, question previously unchallenged assumptions and practices, and become a competent digital communicator in a wide range of professional contexts.
Resumo:
This study explored the critical features of temporal synchrony for the facilitation of prenatal perceptual learning with respect to unimodal stimulation using an animal model, the bobwhite quail. The following related hypotheses were examined: (1) the availability of temporal synchrony is a critical feature to facilitate prenatal perceptual learning, (2) a single temporally synchronous note is sufficient to facilitate prenatal perceptual learning, with respect to unimodal stimulation, and (3) in situations where embryos are exposed to a single temporally synchronous note, facilitated perceptual learning, with respect to unimodal stimulation, will be optimal when the temporally synchronous note occurs at the onset of the stimulation bout. To assess these hypotheses, two experiments were conducted in which quail embryos were exposed to various audio-visual configurations of a bobwhite maternal call and tested at 24 hr after hatching for evidence of facilitated prenatal perceptual learning with respect to unimodal stimulation. Experiment 1 explored if intermodal equivalence was sufficient to facilitate prenatal perceptual learning with respect to unimodal stimulation. A Bimodal Sequential Temporal Equivalence (BSTE) condition was created that provided embryos with sequential auditory and visual stimulation in which the same amodal properties (rate, duration, rhythm) were made available across modalities. Experiment 2 assessed: (a) whether a limited number of temporally synchronous notes are sufficient for facilitated prenatal perceptual learning with respect to unimodal stimulation, and (b) whether there is a relationship between timing of occurrence of a temporally synchronous note and the facilitation of prenatal perceptual learning. Results revealed that prenatal exposure to BSTE was not sufficient to facilitate perceptual learning. In contrast, a maternal call that contained a single temporally synchronous note was sufficient to facilitate embryos’ prenatal perceptual learning with respect to unimodal stimulation. Furthermore, the most salient prenatal condition was that which contained the synchronous note at the onset of the call burst. Embryos’ prenatal perceptual learning of the call was four times faster in this condition than when exposed to a unimodal call. Taken together, bobwhite quail embryos’ remarkable sensitivity to temporal synchrony suggests that this amodal property plays a key role in attention and learning during prenatal development.
Resumo:
Context: Clinicians use exercises in rehabilitation to enhance sensorimotor-function, however evidence supporting their use is scarce. Objective: To evaluate acute effects of handheld-vibration on joint position sense (JPS). Design: A repeated-measure, randomized, counter-balanced 3-condition design. Setting: Sports Medicine and Science Research Laboratory. Patients or Other Participants: 31 healthy college-aged volunteers (16-males, 15-females; age=23+3y, mass=76+14kg, height=173+8cm). Interventions: We measured elbow JPS and monitored training using the Flock-of-Birds system (Ascension Technology, Burlington, VT) and MotionMonitor software (Innsport, Chicago, IL), accurate to 0.5°. For each condition (15,5,0Hz vibration), subjects completed three 15-s bouts holding a 2.55kg Mini-VibraFlex dumbbell (Orthometric, New York, NY), and used software-generated audio/visual biofeedback to locate the target. Participants performed separate pre- and post-test JPS measures for each condition. For JPS testing, subjects held a non-vibrating dumbbell, identified the target (90°flexion) using biofeedback, and relaxed 3-5s. We removed feedback and subjects recreated the target and pressed a trigger. We used SPSS 14.0 (SPSS Inc., Chicago, IL) to perform separate ANOVAs (p<0.05) for each protocol and calculated effect sizes using standard-mean differences. Main Outcome Measures: Dependent variables were absolute and variable error between target and reproduced angles, pre-post vibration training. Results: 0Hz (F1,61=1.310,p=0.3) and 5Hz (F1,61=2.625,p=0.1) vibration did not affect accuracy. 15Hz vibration enhanced accuracy (6.5±0.6 to 5.0±0.5°) (F1,61=8.681,p=0.005,ES=0.3). 0Hz did not affect variability (F1,61=0.007,p=0.9). 5Hz vibration decreased variability (3.0±1.8 to 2.3±1.3°) (F1,61=7.250,p=0.009), as did 15Hz (2.8±1.8 to 1.8±1.2°) (F1,61=24.027, p<0.001). Conclusions: Our results support using handheld-vibration to improve sensorimotor-function. Future research should include injured subjects, functional multi-joint/multi-planar measures, and long-term effects of similar training.
Resumo:
This study explored the critical features of temporal synchrony for the facilitation of prenatal perceptual learning with respect to unimodal stimulation using an animal model, the bobwhite quail. The following related hypotheses were examined: (1) the availability of temporal synchrony is a critical feature to facilitate prenatal perceptual learning, (2) a single temporally synchronous note is sufficient to facilitate prenatal perceptual learning, with respect to unimodal stimulation, and (3) in situations where embryos are exposed to a single temporally synchronous note, facilitated perceptual learning, with respect to unimodal stimulation, will be optimal when the temporally synchronous note occurs at the onset of the stimulation bout. To assess these hypotheses, two experiments were conducted in which quail embryos were exposed to various audio-visual configurations of a bobwhite maternal call and tested at 24 hr after hatching for evidence of facilitated prenatal perceptual learning with respect to unimodal stimulation. Experiment 1 explored if intermodal equivalence was sufficient to facilitate prenatal perceptual learning with respect to unimodal stimulation. A Bimodal Sequential Temporal Equivalence (BSTE) condition was created that provided embryos with sequential auditory and visual stimulation in which the same amodal properties (rate, duration, rhythm) were made available across modalities. Experiment 2 assessed: (a) whether a limited number of temporally synchronous notes are sufficient for facilitated prenatal perceptual learning with respect to unimodal stimulation, and (b) whether there is a relationship between timing of occurrence of a temporally synchronous note and the facilitation of prenatal perceptual learning. Results revealed that prenatal exposure to BSTE was not sufficient to facilitate perceptual learning. In contrast, a maternal call that contained a single temporally synchronous note was sufficient to facilitate embryos’ prenatal perceptual learning with respect to unimodal stimulation. Furthermore, the most salient prenatal condition was that which contained the synchronous note at the onset of the call burst. Embryos’ prenatal perceptual learning of the call was four times faster in this condition than when exposed to a unimodal call. Taken together, bobwhite quail embryos’ remarkable sensitivity to temporal synchrony suggests that this amodal property plays a key role in attention and learning during prenatal development.
Resumo:
A sociedade digital nos abraça em todos os aspectos do cotidiano e uma parte significativa da população vive conectada em multiplataformas. Com a instantaneidade dos fluxos de comunicação, vivemos uma rotina onde muitos acessos estão a um “clique” ou toque. A televisão como mídia preponderante durante várias décadas, na sua transição digital comporta uma função além da TV que conhecíamos, como display interativo que se conecta e absorve conteúdos provenientes de várias fontes. Os consagrados modelos mundiais de distribuição de audiovisual, especialmente pelo Broadcast, sofrem as consequências da mudança do comportamento do seu público pelas novas oportunidades de acesso aos conteúdos, agora interativos e sob demanda. Neste contexto, os modelos das SmartTVs (TVs conectadas) em Broadband (Banda Larga) apresentam opções diferenciadas e requerem um espaço cada vez maior na conexão com todos os outros displays. Com este cenário, o presente estudo busca descrever e analisar as novas ofertas de conteúdos, aplicativos, possibilidades e tendências do hibridismo das fontes para a futura TV.