998 resultados para VOICE QUALITY
Resumo:
Skype is one of the well-known applications that has guided the evolution of real-time video streaming and has become one of the most used software in everyday life. It provides VoIP audio/video calls as well as messaging chat and file transfer. Many versions are available covering all the principal operating systems like Windows, Macintosh and Linux but also mobile systems. Voice quality decreed Skype success since its birth in 2003 and peer-to-peer architecture has allowed worldwide diffusion. After video call introduction in 2006 Skype became a complete solution to communicate between two or more people. As a primarily video conferencing application, Skype assumes certain characteristics of the delivered video to optimize its perceived quality. However in the last years, and with the recent release of SkypeKit1, many new Skype video-enabled devices came out especially in the mobile world. This forced a change to the traditional recording, streaming and receiving settings allowing for a wide range of network and content dynamics. Video calls are not anymore based on static ‘chatting’ but mobile devices have opened new possibilities and can be used in several scenarios. For instance, lecture streaming or one-to-one mobile video conferences exhibit more dynamics as both caller and callee might be on move. Most of these cases are different from “head&shoulder” only content. Therefore, Skype needs to optimize its video streaming engine to cover more video types. Heterogeneous connections require different behaviors and solutions and Skype must face with this variety to maintain a certain quality independently from connection used. Part of the present work will be focused on analyzing Skype behavior depending on video content. Since Skype protocol is proprietary most of the studies so far have tried to characterize its traffic and to reverse engineer its protocol. However, questions related to the behavior of Skype, especially on quality as perceived by users, remain unanswered. We will study Skype video codecs capabilities and video quality assessment. Another motivation of our work is the design of a mechanism that estimates the perceived cost of network conditions on Skype video delivery. To this extent we will try to assess in an objective way the impact of network impairments on the perceived quality of a Skype video call. Traditional video streaming schemes lack the necessary flexibility and adaptivity that Skype tries to achieve at the edge of a network. Our contribution will lye on a testbed and consequent objective video quality analysis that we will carry out on input videos. We will stream raw video files with Skype via an impaired channel and then we will record it at the receiver side to analyze with objective quality of experience metrics.
Resumo:
A case study of vocal fold paralysis treatment is described with the help of the voice quality analysis application BioMet®Phon. The case corresponds to a description of a 40 - year old female patient who was diagnosed of vocal fold paralysis following a cardio - pulmonar intervention which required intubation for 8 days and posterior tracheotomy for 15 days. The patient presented breathy and asthenic phon ation, and dysphagia. Six main examinations were conducted during a full year period that the treatment lasted consisting in periodic reviews including video - endostroboscopy, voice analysis and breathing function monitoring. The phoniatrician treatment inc luded 20 sessions of vocal rehabilitation, followed by an intracordal infiltration with Radiesse 8 months after the rehabilitation treatment started followed by 6 sessions of rehabilitation more. The videondoscopy and the voicing quality analysis refer a s ubstantial improvement in the vocal function with recovery in all the measures estimated (jitter, shimmer, mucosal wave contents, glottal closure, harmonic contents and biomechanical function analysis). The paper refers the procedure followed and the results obtained by comparing the longitudinal progression of the treatment, illustrating the utility of voice quality analysis tools in speech therapy.
Resumo:
BioMet®Phon is a software application developed for the characterization of voice in voice quality evaluation. Initially it was conceived as plain research code to estimate the glottal source from voice and obtain the biomechanical parameters of the vocal folds from the spectral density of the estimate. This code grew to what is now the Glottex®Engine package (G®E). Further demands from users in laryngology and speech therapy fields instantiated the development of a specific Graphic User Interface (GUI’s) to encapsulate user interaction with the G®E. This gave place to BioMet®Phon, an application which extracts the glottal source from voice and offers a complete parameterization of this signal, including distortion, cepstral, spectral, biomechanical, time domain, contact and tremor parameters. The semantic capabilities of biomechanical parameters are discussed. Study cases from its application to the field of laryngology and speech therapy are given and discussed. Validation results in voice pathology detection are also presented. Applications to laryngology, speech therapy, and monitoring neurological deterioration in the elder are proposed.
Resumo:
The aim of the study was firstly to document the acoustic parameters of voice using the Multidimensional Voice Program (MDVP, Kay Elemetrics) in a group of children with dysarthria subsequent to treatment for cerebellar tumour (CT). Then, secondly, compare the acoustic findings to perceptual voice characteristics as described by the GIRBAS (grade, instability, roughness, breathiness, asthenicity, strain). The assessments were performed on 29 voice samples; 9 cerebellar tumour participants with dysarthria, and 20 control participants. None of the control voices were rated as exhibiting any of the six parameters described by the GIRBAS, while 7 of the CT participants were noted to have at least a mild voice disorder. Roughness, instability, breathiness and asthenicity were all identified as voice characteristics in the CT voice samples. Acoustically, the CT voice samples differed significantly from the controls' voices on frequency and amplitude perturbation measures. Our findings confirmed voice dysfunction as a component of dysarthria in children treated for cerebellar tumour, and discussed the links between acoustic and perceptual descriptions. Copyright (C) 2004 S. Karger AG, Basel.
Evaluation of oral-motor movements and speech in patients with tetanus of a public service in Brazil
Resumo:
The characterisation of oral-motor movements and speech of patients with tetanus were investigated to determine the existence of possible signs that are characteristic of this pathology. Thirteen patients clinically diagnosed with tetanus (10 with severe tetanus and three with very severe tetanus) and admitted to an intensive care unit underwent clinical evaluation of oral-motor movements and speech. Statistical analysis indicated significant between-group differences for speech motor functions, suggesting that individuals with very severe tetanus present rigidity as a characteristic interfering in articulatory precision (P = 0 035) and movement rate (P = 0 038). For lip closure, tongue movement, palatal elevation, gag reflex and voice quality, no between-group differences were identified for the specific abnormal characteristics. The observed abnormal results indicate that muscle strength and functional status of the oral-motor system presented by most of the participants of the study did not ensure the necessary integrity for satisfactory performance. The characterisation of the oral myofunctional aspects of patients with tetanus provides medical teams, patients and families with a wider and better description of the clinical situation, giving support to the diagnosis, prognostics and treatment.
Resumo:
Objectives: Injectable corticosteroids have been used in phonosurgery to prevent scarring of the vocal fold because of their effects of wound healing, and to ensure better voice quality. We histologically evaluated the effects of dexamethasone sodium phosphate infiltration on acute vocal fold wound healing in rabbits 3 and 7 days after surgically induced injury by quantification of the inflammatory reaction and collagen deposition. Methods: A standardized surgical incision was made in the vocal folds of 12 rabbits, and 0.1 mL dexamethasone sodium phosphate (4 mg/mL) was injected into the left vocal fold. The right vocal fold was not injected and served as the control. The larynges were collected 3 and 7 days after surgery. For histologic analysis, the vocal folds were stained with hematoxylin-eosin for quantification of the inflammatory response and with picrosirius red for qunatification of collagen depostion. Results: There was no quantitative difference in the inflammatory response between vocal folds injected with the corticosteroid and control vocal folds. However, the rate of collage deposition was significantly lower in the corticosteroid-treated group at 3 and 7 days after injury (p = 0.002). Conclusions: The present results suggest that dexamethasone reduces collagen depostion during acute vocal fold wound healing.
Resumo:
The objective of the study was to analyze comparatively the jitter and shimmer values of spoken voice among women in menacme and menopausal women using or not hormonal replacement therapy (HRT). Forty-five women were studied, divided into the following groups: Control Group (CG), 15 women aged 20-40 years with regular menstrual cycles who did not take hormonal contraceptives, Treated Group (TG), 15 women aged 45-60 years with at least 2 years of menopause, under continuous HRT with I mg estradiol valerate + 90 mu g norgestimate per day for at least 6 months; Untreated Group (UG), 15 women aged 45-60 years with at least 2 years of menopause who did not use HRT. Mean age was 30.3, 54.5, and 56.5 years for CG, TG, and UG, respectively. All subjects were submitted to acoustic analysis of jitter and shimmer for the sustained vowels /e/ and /i/. Mean jitter values were 0.56%, 0.64%, and 0.56% for the vowel /e/ and 0.88%, 0.79%, and 0.68% for the vowel /i/ for CG, TG, and UG, respectively. Mean shimmer values were 4.17%, 4.38%, and 4.77% for the vowel /e/ and 5.19%, 4.59%, and 5.37% for the vowel /i/ for CG, TG, and UG, respectively. There were no significant differences between the groups studied. The results obtained here by the methodology used suggest that there were no significant differences in jitter and shimmer when we assessed the sustained vowels /i/ and /e/ between menopausal women using or not HRT or between young and menopausal women treated or not.
Resumo:
OBJECTIVE To compare the effectiveness of two speech therapy interventions, vocal warm-up and breathing training, focusing on teachers’ voice quality.METHODS A single-blind, randomized, parallel clinical trial was conducted. The research included 31 20 to 60-year old teachers from a public school in Salvador, BA, Northeasatern Brazil, with minimum workloads of 20 hours a week, who have or have not reported having vocal alterations. The exclusion criteria were the following: being a smoker, excessive alcohol consumption, receiving additional speech therapy assistance while taking part in the study, being affected by upper respiratory tract infections, professional use of the voice in another activity, neurological disorders, and history of cardiopulmonary pathologies. The subjects were distributed through simple randomization in groups vocal warm-up (n = 14) and breathing training (n = 17). The teachers’ voice quality was subjectively evaluated through the Voice Handicap Index (Índice de Desvantagem Vocal, in the Brazilian version) and computerized voice analysis (average fundamental frequency, jitter, shimmer, noise, and glottal-to-noise excitation ratio) by speech therapists.RESULTS Before the interventions, the groups were similar regarding sociodemographic characteristics, teaching activities, and vocal quality. The variations before and after the intervention in self-assessment and acoustic voice indicators have not significantly differed between the groups. In the comparison between groups before and after the six-week interventions, significant reductions in the Voice Handicap Index of subjects in both groups were observed, as wells as reduced average fundamental frequencies in the vocal warm-up group and increased shimmer in the breathing training group. Subjects from the vocal warm-up group reported speaking more easily and having their voices more improved in a general way as compared to the breathing training group.CONCLUSIONS Both interventions were similar regarding their effects on the teachers’ voice quality. However, each contribution has individually contributed to improve the teachers’ voice quality, especially the vocal warm-up.TRIAL RECORD NCT02102399, “Vocal Warm-up and Respiratory Muscle Training in Teachers”.
Partial cricotracheal resection for pediatric subglottic stenosis: long-term outcome in 57 patients.
Resumo:
OBJECTIVE: We sought to assess the long-term outcome of 57 pediatric patients who underwent partial cricotracheal resection for subglottic stenosis. METHODS: Eighty-one pediatric partial cricotracheal resections were performed in our tertiary care institution between 1978 and 2004. Fifty-seven patients had a minimal follow-up time of 1 year and were included in this study. Evaluation was based on the last laryngotracheal endoscopy, the responses to a questionnaire, and a retrospective review of the patient's data. The following parameters were analyzed: decannulation rates, breathing, voice quality, and deglutition. RESULTS: A single-stage partial cricotracheal resection was performed in 38 patients, and a double-stage procedure was performed in 19 patients. Sixteen patients underwent an extended partial cricotracheal resection (ie, partial cricotracheal resection combined with another open procedure). At a median follow-up time of 5.1 years, the decannulation rates after a single- or double-stage procedure were 97.4% and 95%, respectively. Two patients remained tracheotomy dependent. One patient had moderate exertional dyspnea, and all other patients had no exertional dyspnea. Voice quality was found to improve after surgical intervention for 1 +/- 1.34 grade dysphonia (P < .0001) according to the adapted GRBAS grading system (Grade, Roughness, Breathiness, Asthenia, and Strain). CONCLUSIONS: Partial cricotracheal resection provides good results for grades III and IV subglottic stenosis as primary or salvage operations. The procedure has no deleterious effects on laryngeal growth and function. The quality of voice significantly improves after surgical intervention but largely depends on the preoperative condition.
Resumo:
Medialization laryngoplasty was performed in 25 patients between 1993 and 1997. The underlying pathology resulting in glottal incompetence was vocal cord paralysis in 22 patients and vocal cord bowing in 3 patients. Two types of implants were used: self-carved Proplast in 19 patients and prefabricated hydroxyapatite prostheses in 6 patients. Preoperative and postoperative results were compared in terms of dysphagia, vocal quality as graded by three experienced voice specialists, and computer measurements of the glottal gap. All patients showed improvement both subjectively and on the objective measurements used. Swallowing returned to normal in all patients who had isolated recurrent laryngeal nerve paralysis. The voice improved in all patients but was rarely judged as entirely normal.
Resumo:
Velopharyngeal insufficiency (VPI) is a structural or functional trouble, which causes hypernasal speech. Velopharyngeal flaps, speech therapy and augmentation pharyngoplasty, using different implants, have all been used to address this trouble. We hereby present our results following rhinopharyngeal autologous fat injection in 18 patients with mild velopharyngeal insufficiency (12 soft palate clefts, 4 functional VPI, 2 myopathy). 28 injections were carried out between 2004 and 2007. The degree of hypernasal speech was evaluated pre- and postoperatively by a speech therapist and an ENT specialist and quantified by an acoustic nasometry (Kay Elemetrics). All patients were exhaustively treated with preoperative speech therapy (average, 8 years). The mean value of the nasalance score was 37% preoperatively and 23% postoperatively (p = 0.015). The hypernasality was reduced postoperatively in all patients (1-3 degrees of the Borel-Maisonny score). There were no major complications, two minor complications (one hematoma, one cervical pain). The autologous fat injection is a simple, safe, minimally invasive procedure. It proves to be efficient in cases of mild velopharyngeal insufficiency or after a suboptimal velopharyngoplasty.
Resumo:
OBJECTIVES: To delineate the various factors contributing to failure or delay in decannulation after partial cricotracheal resection (PCTR) in children. STUDY DESIGN: Case series. SETTING: Academic tertiary medical center. SUBJECTS AND METHODS: A retrospective case review of 100 children who underwent PCTR between 1978 and 2008 for severe subglottic stenosis using an ongoing database. RESULTS: Ninety of 100 (90%) patients were decannulated. Six patients needed secondary tracheostomy. The results of the preoperative evaluation showed grade II stenosis in four patients, grade III in 64 patients, and grade IV in 32 patients. The overall decannulation rate was 100 percent in grade II, 95 percent in grade III, and 78 percent in grade IV stenosis. Fourteen (14%) patients required revision open surgery. The most common cause of revision surgery was posterior glottic stenosis. Partial anastomotic dehiscence was seen in four patients. Delayed decannulation (>1 year) occurred in nine patients. Overall mortality rate in the whole series was 6 percent. No deaths were directly related to the surgery. No iatrogenic recurrent laryngeal nerve injury was present in the entire series. CONCLUSION: Comorbidities and associated syndromes should be addressed before PCTR is planned to improve the final postoperative outcome in terms of decannulation. Perioperative morbidity due to anastomotic dehiscence, to a certain extent, can be avoided by intraoperative judgment in the selection of double-stage surgery when more than five tracheal rings need to be resected. Subglottic stenosis with glottic involvement continues to pose a difficult challenge to pediatric otolaryngologists, often necessitating revision procedures.
Resumo:
The purpose of the research project The poetics of the talking book is to contribute to the knowledge about patterns of understanding in young adults’ reception of fiction, which they listened to through audio books. The problem explored was: How do different groups of listeners receive fictive text presented as a talking book with variations regarding use of voice, engagement and sound effects? The problem formulation rendered four specific research questions: 1. What patterns can be identified in the listeners’ answers regarding story structure and cognitive content in a comparative perspective comprising different reading styles in the taped versions of the text? 2. What patterns of understanding in interpretative reading can be identified in different listeners? 3. Which thoughts do the listeners have about what the talking book should sound like? 4. What affordances for young adults with the functional disability of mild mental retardation can be made visible through guided literature conversations? The theoretical frame of reference was formed by text–reader-oriented literary theory, psychological schema theory, and research regarding voice quality and communication. The project was carried out in two steps. The first phase was to produce the audio books with two variations of reading practice of three short stories with an existential theme in each text. The second step comprised interviewing of 32 young adults (a special group with a reading handicap in form of mild mental retardation, and a reference group with no handicap). The interviews formed as literary conversation were carried out three times during one year. The phenomenological-hermeneutic approach focused on the life worlds of the participants as meaning seeking beings. The analysis was carried out using method triangulation, mainly using phenomenological meaning concentration. The double hermeneutics in use when interpreting the interpretations of the participants revealed a capacity for aesthetic reading of fiction in the special group as well as in the reference group. The aesthetic qualities were found sufficient in all variations of reading by the professional readers of the audio book they listened to. The young adults also could describe how they wanted the audio book to sound: just as if you were reading yourself. A model describing the analytical steps and concepts in use was a result that can serve as an outline of a poetics for the talking book. Unexpected research results were how important the guided literary conversation turned out to be in order to realise the affordances given by the texts regarding exploration of existential themes in the young adults’ life worlds. Thus the result of the research project can be positioned as a piece of emancipatory research stressing the importance of including this group of young adults in the society’s conversation about culture and meaning.
Resumo:
The media tends to represent female athletes as women first and athletes second (Koivula, 1 999). The present study investigated whether this same trend was present for female sportscasters, using a self-presentational framework. Self-presentation is the process by which people try to control how others see them (Leary, 1995). One factor that may influence the type of image they try to project is their roles held in society, including gender roles. The gender roles for a man include dominance, assertiveness, and masculinity, while the gender roles for a woman include nurturer, femininity, and attractiveness (Deaux & Major, 1 987). By contrast, sports broadcasters are expected to be knowledgeable, assertive, and competent. Research suggests that female sports broadcasters are seen as less competent and less persuasive than male sports broadcasters (Mitrook & Dorr, 2001; Ordman & Zillmann, 1994, Toro, 2005). One reason for this difference may be that the gender roles for a man are much more similar to those of a sportscaster, compared to those of a woman. Thus, there may be a conflict between the two roles for women. The present study investigated whether the gender and perceived attractiveness of sportscasters influenced the audience's perceptions of the level of competence that a sportscaster demonstrates. Two hundred and four male (n =75) and female (n =129) undergraduate students were recruited from a southern Ontario university to participate in the study. The average age of the male participants was 21 .23 years {SD =1 .60), and the average age for female participants was 20.67 years {SD = 1 .31). The age range for all participants was from 19 to 30 years {M = 20.87 years, SD = 1 .45). Af^er providing informed consent, participants randomly received one of four possible questionnaire packages. The participants answered the demographic questionnaire, and then proceeded to view the picture and read the script of a sports newscast. Next, based on the picture and script, the participants answered the competence questionnaire, assessing the general, sport specific, and overall competence of the sportscaster. Once participants had finished, they returned the package to the researcher and were thanked for their time. Data was analyzed using an ANOVA to determine if general sport competence differs with respect to gender and attractiveness of the sportscaster. Overall, the ANOVA was non-significant (p > .05), indicating no differences on the dependent variable based on gender (F (3, 194) = .631, p = .426), attractiveness (F (3, 194) = .070, p = .791), or the interaction of the two {F (3, 194) = .043,/? = .836). Although none of the study hypotheses were supported, the study provided some insight to the perceived competence of female sportscasters. It is possible that female sportscasters are now seen as competent in the area of sports. Sample characteristics could also have influenced these results; the participants in the current study were primarily physical education and kinesiology students, who had experience participating in physical activity with both men and women. Future research should investigate this issue further by using a video sportscast. It is possible that delivery characteristics such as voice quality or eye contact may also impact perceptions of sportscasters.
Resumo:
Parkinson's disease (PD) is a degenerative illness whose cardinal symptoms include rigidity, tremor, and slowness of movement. In addition to its widely recognized effects PD can have a profound effect on speech and voice.The speech symptoms most commonly demonstrated by patients with PD are reduced vocal loudness, monopitch, disruptions of voice quality, and abnormally fast rate of speech. This cluster of speech symptoms is often termed Hypokinetic Dysarthria.The disease can be difficult to diagnose accurately, especially in its early stages, due to this reason, automatic techniques based on Artificial Intelligence should increase the diagnosing accuracy and to help the doctors make better decisions. The aim of the thesis work is to predict the PD based on the audio files collected from various patients.Audio files are preprocessed in order to attain the features.The preprocessed data contains 23 attributes and 195 instances. On an average there are six voice recordings per person, By using data compression technique such as Discrete Cosine Transform (DCT) number of instances can be minimized, after data compression, attribute selection is done using several WEKA build in methods such as ChiSquared, GainRatio, Infogain after identifying the important attributes, we evaluate attributes one by one by using stepwise regression.Based on the selected attributes we process in WEKA by using cost sensitive classifier with various algorithms like MultiPass LVQ, Logistic Model Tree(LMT), K-Star.The classified results shows on an average 80%.By using this features 95% approximate classification of PD is acheived.This shows that using the audio dataset, PD could be predicted with a higher level of accuracy.