861 resultados para speech delay


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Dissertação apresentada na Faculdade de Ciências e Tecnologia da Universidade Nova de Lisboa para obtenção do grau de Mestre em Engenharia Electrotécnica e Computadores

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In the last decade, both scientific community and automotive industry enabled communications among vehicles in different kinds of scenarios proposing different vehicular architectures. Vehicular delay-tolerant networks (VDTNs) were proposed as a solution to overcome some of the issues found in other vehicular architectures, namely, in dispersed regions and emergency scenarios. Most of these issues arise from the unique characteristics of vehicular networks. Contrary to delay-tolerant networks (DTNs), VDTNs place the bundle layer under the network layer in order to simplify the layered architecture and enable communications in sparse regions characterized by long propagation delays, high error rates, and short contact durations. However, such characteristics turn contacts very important in order to exchange as much information as possible between nodes at every contact opportunity. One way to accomplish this goal is to enforce cooperation between network nodes. To promote cooperation among nodes, it is important that nodes share their own resources to deliver messages from others. This can be a very difficult task, if selfish nodes affect the performance of cooperative nodes. This paper studies the performance of a cooperative reputation system that detects, identify, and avoid communications with selfish nodes. Two scenarios were considered across all the experiments enforcing three different routing protocols (First Contact, Spray and Wait, and GeoSpray). For both scenarios, it was shown that reputation mechanisms that punish aggressively selfish nodes contribute to increase the overall network performance.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper explores the calculation of fractional integrals by means of the time delay operator. The study starts by reviewing the memory properties of fractional operators and their relationship with time delay. Based on the time response of the Mittag-Leffler function an approximation of fractional integrals consisting of time delayed samples is proposed. The tuning of the approximation is optimized by means of a genetic algorithm. The results demonstrate the feasibility of the new perspective and the limits of their application.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

As the wireless cellular market reaches competitive levels never seen before, network operators need to focus on maintaining Quality of Service (QoS) a main priority if they wish to attract new subscribers while keeping existing customers satisfied. Speech Quality as perceived by the end user is one major example of a characteristic in constant need of maintenance and improvement. It is in this topic that this Master Thesis project fits in. Making use of an intrusive method of speech quality evaluation, as a means to further study and characterize the performance of speech codecs in second-generation (2G) and third-generation (3G) technologies. Trying to find further correlation between codecs with similar bit rates, along with the exploration of certain transmission parameters which may aid in the assessment of speech quality. Due to some limitations concerning the audio analyzer equipment that was to be employed, a different system for recording the test samples was sought out. Although the new designed system is not standard, after extensive testing and optimization of the system's parameters, final results were found reliable and satisfactory. Tests include a set of high and low bit rate codecs for both 2G and 3G, where values were compared and analysed, leading to the outcome that 3G speech codecs perform better, under the approximately same conditions, when compared with 2G. Reinforcing the idea that 3G is, with no doubt, the best choice if the costumer looks for the best possible listening speech quality. Regarding the transmission parameters chosen for the experiment, the Receiver Quality (RxQual) and Received Energy per Chip to the Power Density Ratio (Ec/N0), these were subject to speech quality correlation tests. Final results of RxQual were compared to those of prior studies from different researchers and, are considered to be of important relevance. Leading to the confirmation of RxQual as a reliable indicator of speech quality. As for Ec/N0, it is not possible to state it as a speech quality indicator however, it shows clear thresholds for which the MOS values decrease significantly. The studied transmission parameters show that they can be used not only for network management purposes but, at the same time, give an expected idea to the communications engineer (or technician) of the end-to-end speech quality consequences. With the conclusion of the work new ideas for future studies come to mind. Considering that the fourth-generation (4G) cellular technologies are now beginning to take an important place in the global market, as the first all-IP network structure, it seems of great relevance that 4G speech quality should be subject of evaluation. Comparing it to 3G, not only in narrowband but also adding wideband scenarios with the most recent standard objective method of speech quality assessment, POLQA. Also, new data found on Ec/N0 tests, justifies further research studies with the intention of validating the assumptions made in this work.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Slowed atrial conduction may contribute to reentry circuits and vulnerability for atrial fibrillation (AF). The autonomic nervous system (ANS) has modulating effects on electrophysiological properties. However, complex interactions of the ANS with the arrhythmogenic substrate make it difficult to understand the mechanisms underlying induction and maintenance of AF. AIM: To determine the effect of acute ANS modulation in atrial activation times in patients (P) with paroxysmal AF (PAF). METHODS AND RESULTS: 16P (9 men; 59±14years) with PAF, who underwent electrophysiological study before AF ablation, and 15P (7 men; 58±11years) with atrioventricular nodal reentry tachycardia, without documentation or induction of AF (control group). Each group included 7P with arterial hypertension but without underlying structural heart disease. The study was performed while off drugs. Multipolar catheters were placed at the high right atrium (HRA), right atrial appendage (RAA), coronary sinus (CS) and His bundle area (His). At baseline and with HRA pacing (600ms, shortest propagated S2) we measured: i) intra-atrial conduction time (IACT, between RAA and atrial deflection in the distal His), ii) inter-atrial conduction time (interACT, between RAA and distal CS), iii) left atrial activation time (LAAT, between atrial deflection in the distal His and distal CS), iv) bipolar electrogram duration at four atrial sites (RAA, His, proximal and distal CS). In the PAF group, measurements were also determined during handgrip and carotid sinus massage (CSM), and after pharmacological blockade of the ANS (ANSB). AF was induced by HRA programmed stimulation in 56% (self-limited - 6; sustained - 3), 68.8% (self-limited - 6; sustained - 5), and 50% (self-limited - 5; sustained - 3) of the P, in basal, during ANS maneuvers, and after ANSB, respectively (p=NS). IACT, interACT and LAAT significantly lengthened during HRA pacing in both groups (600ms, S2). P with PAF have longer IACT (p<0.05), a higher increase in both IACT, interACT (p<0.01) and electrograms duration (p<0.05) with S2, and more fragmented activity, compared with the control group. Atrial conduction times and electrograms duration were not significantly changed during ANS stimulation. Nevertheless, ANS maneuvers increased heterogeneity of the local electrograms duration. Also, P with sustained AF showed longer interACT and LAAT during CSM. CONCLUSION: Atrial conduction times, electrograms duration and fractionated activity are increased in PAF, suggesting a role for conduction delays in the arrhythmogenic substrate. Acute vagal stimulation is associated with prolonged interACT and LAAT in P with inducible sustained AF and ANS modulation may influence the heterogeneity of atrial electrograms duration.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this work an adaptive modeling and spectral estimation scheme based on a dual Discrete Kalman Filtering (DKF) is proposed for speech enhancement. Both speech and noise signals are modeled by an autoregressive structure which provides an underlying time frame dependency and improves time-frequency resolution. The model parameters are arranged to obtain a combined state-space model and are also used to calculate instantaneous power spectral density estimates. The speech enhancement is performed by a dual discrete Kalman filter that simultaneously gives estimates for the models and the signals. This approach is particularly useful as a pre-processing module for parametric based speech recognition systems that rely on spectral time dependent models. The system performance has been evaluated by a set of human listeners and by spectral distances. In both cases the use of this pre-processing module has led to improved results.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Speech interfaces for Assistive Technologies are not common and are usually replaced by others. The market they are targeting is not considered attractive and speech technologies are still not well spread. Industry still thinks they present some performance risks, especially Speech Recognition systems. As speech is the most elemental and natural way for communication, it has strong potential for enhancing inclusion and quality of life for broader groups of users with special needs, such as people with cerebral palsy and elderly staying at their homes. This work is a position paper in which the authors argue for the need to make speech become the basic interface in assistive technologies. Among the main arguments, we can state: speech is the easiest way to interact with machines; there is a growing market for embedded speech in assistive technologies, since the number of disabled and elderly people is expanding; speech technology is already mature to be used but needs adaptation to people with special needs; there is still a lot of R&D to be done in this area, especially when thinking about the Portuguese market. The main challenges are presented and future directions are proposed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper, a rule-based automatic syllabifier for Danish is described using the Maximal Onset Principle. Prior success rates of rule-based methods applied to Portuguese and Catalan syllabification modules were on the basis of this work. The system was implemented and tested using a very small set of rules. The results gave rise to 96.9% and 98.7% of word accuracy rate, contrary to our initial expectations, being Danish a language with a complex syllabic structure and thus difficult to be rule-driven. Comparison with data-driven syllabification system using artificial neural networks showed a higher accuracy rate of the former system.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The recent developments on Hidden Markov Models (HMM) based speech synthesis showed that this is a promising technology fully capable of competing with other established techniques. However some issues still lack a solution. Several authors report an over-smoothing phenomenon on both time and frequencies which decreases naturalness and sometimes intelligibility. In this work we present a new vowel intelligibility enhancement algorithm that uses a discrete Kalman filter (DKF) for tracking frame based parameters. The inter-frame correlations are modelled by an autoregressive structure which provides an underlying time frame dependency and can improve time-frequency resolution. The system’s performance has been evaluated using objective and subjective tests and the proposed methodology has led to improved results.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this work an adaptive filtering scheme based on a dual Discrete Kalman Filtering (DKF) is proposed for Hidden Markov Model (HMM) based speech synthesis quality enhancement. The objective is to improve signal smoothness across HMMs and their related states and to reduce artifacts due to acoustic model's limitations. Both speech and artifacts are modelled by an autoregressive structure which provides an underlying time frame dependency and improves time-frequency resolution. Themodel parameters are arranged to obtain a combined state-space model and are also used to calculate instantaneous power spectral density estimates. The quality enhancement is performed by a dual discrete Kalman filter that simultaneously gives estimates for the models and the signals. The system's performance has been evaluated using mean opinion score tests and the proposed technique has led to improved results.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Surveillance registers monitor the prevalence of cerebral palsy and the severity of resulting impairments across time and place. The motor disorders of cerebral palsy can affect children’s speech production and limit their intelligibility. We describe the development of a scale to classify children’s speech performance for use in cerebral palsy surveillance registers, and its reliability across raters and across time. Speech and language therapists, other healthcare professionals and parents classified the speech of 139 children with cerebral palsy (85 boys, 54 girls; mean age 6.03 years, SD 1.09) from observation and previous knowledge of the children. Another group of health professionals rated children’s speech from information in their medical notes. With the exception of parents, raters reclassified children’s speech at least four weeks after their initial classification. Raters were asked to rate how easy the scale was to use and how well the scale described the child’s speech production using Likert scales. Inter-rater reliability was moderate to substantial (k > .58 for all comparisons). Test–retest reliability was substantial to almost perfect for all groups (k > .68). Over 74% of raters found the scale easy or very easy to use; 66% of parents and over 70% of health care professionals judged the scale to describe children’s speech well or very well. We conclude that the Viking Speech Scale is a reliable tool to describe the speech performance of children with cerebral palsy, which can be applied through direct observation of children or through case note review.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This dissertation analyzes the possibilities of utilizing speech-processing technologies to transform the user experience of ActivoBank’s customers while using remote banking solutions. The technologies are examined through different criteria to determine if they support the bank’s goals and strategy and whether they should be incorporated in the bank’s offering. These criteria include the alignment with ActivoBank’s values, the suitability of the technology providers, the benefits these technologies entail, potential risks, appeal to the customers and impact on customer satisfaction. The analysis suggests that ActivoBank might not be in a position to adopt these technologies at this point in time.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Graphics based systems of Augmented and Alternative Communication are widely used to promote communication in people with Autism Spectrum Disorders. This study discusses an integration of Augmented Reality in communication interventions, by relating elements of Augmented and Alternative Communication and Applied Behaviour Analysis strategies. An architecture for an Augmented Reality based interactive system to assist interventions is proposed. STAR provides an Augmented Reality tool to assist interventions performed by therapists and support for parents to join in and participate in the child’s intervention. Finally we report on the usage of the Augmented Reality tool in interventions with children with Autism Spectrum Disorders.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

One of the major challenges in the development of an immersive system is handling the delay between the tracking of the user’s head position and the updated projection of a 3D image or auralised sound, also called end-to-end delay. Excessive end-to-end delay can result in the general decrement of the “feeling of presence”, the occurrence of motion sickness and poor performance in perception-action tasks. These latencies must be known in order to provide insights on the technological (hardware/software optimization) or psychophysical (recalibration sessions) strategies to deal with them. Our goal was to develop a new measurement method of end-to-end delay that is both precise and easily replicated. We used a Head and Torso simulator (HATS) as an auditory signal sensor, a fast response photo-sensor to detect a visual stimulus response from a Motion Capture System, and a voltage input trigger as real-time event. The HATS was mounted in a turntable which allowed us to precisely change the 3D sound relative to the head position. When the virtual sound source was at 90º azimuth, the correspondent HRTF would set all the intensity values to zero, at the same time a trigger would register the real-time event of turning the HATS 90º azimuth. Furthermore, with the HATS turned 90º to the left, the motion capture marker visualization would fell exactly in the photo-sensor receptor. This method allowed us to precisely measure the delay from tracking to displaying. Moreover, our results show that the method of tracking, its tracking frequency, and the rendering of the sound reflections are the main predictors of end-to-end delay.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Buruli Ulcer (BU) is a neglected infectious disease caused by Mycobacterium ulcerans that is responsible for severe necrotizing cutaneous lesions that may be associated with bone involvement. Clinical presentations of BU lesions are classically classified as papules, nodules, plaques and edematous infiltration, ulcer or osteomyelitis. Within these different clinical forms, lesions can be further classified as severe forms based on focality (multiple lesions), lesions' size (>15 cm diameter) or WHO Category (WHO Category 3 lesions). There are studies reporting an association between delay in seeking medical care and the development of ulcerative forms of BU or osteomyelitis, but the effect of time-delay on the emergence of lesions classified as severe has not been addressed. To address both issues, and in a cohort of laboratory-confirmed BU cases, 476 patients from a medical center in Allada, Benin, were studied. In this laboratory-confirmed cohort, we validated previous observations, demonstrating that time-delay is statistically related to the clinical form of BU. Indeed, for non-ulcerated forms (nodule, edema, and plaque) the median time-delay was 32.5 days (IQR 30.0-67.5), while for ulcerated forms it was 60 days (IQR 20.0-120.0) (p = 0.009), and for bone lesions, 365 days (IQR 228.0-548.0). On the other hand, we show here that time-delay is not associated with the more severe phenotypes of BU, such as multi-focal lesions (median 90 days; IQR 56-217.5; p = 0.09), larger lesions (diameter >15 cm) (median 60 days; IQR 30-120; p = 0.92) or category 3 WHO classification (median 60 days; IQR 30-150; p = 0.20), when compared with unifocal (median 60 days; IQR 30-90), small lesions (diameter =15 cm) (median 60 days; IQR 30-90), or WHO category 1+2 lesions (median 60 days; IQR 30-90), respectively. Our results demonstrate that after an initial period of progression towards ulceration or bone involvement, BU lesions become stable regarding size and focal/multi-focal progression. Therefore, in future studies on BU epidemiology, severe clinical forms should be systematically considered as distinct phenotypes of the same disease and thus subjected to specific risk factor investigation.