904 resultados para audio-visual automatic speech recognition


Relevância:

30.00% 30.00%

Publicador:

Resumo:

The aim of this thesis is to investigate computerized voice assessment methods to classify between the normal and Dysarthric speech signals. In this proposed system, computerized assessment methods equipped with signal processing and artificial intelligence techniques have been introduced. The sentences used for the measurement of inter-stress intervals (ISI) were read by each subject. These sentences were computed for comparisons between normal and impaired voice. Band pass filter has been used for the preprocessing of speech samples. Speech segmentation is performed using signal energy and spectral centroid to separate voiced and unvoiced areas in speech signal. Acoustic features are extracted from the LPC model and speech segments from each audio signal to find the anomalies. The speech features which have been assessed for classification are Energy Entropy, Zero crossing rate (ZCR), Spectral-Centroid, Mean Fundamental-Frequency (Meanf0), Jitter (RAP), Jitter (PPQ), and Shimmer (APQ). Naïve Bayes (NB) has been used for speech classification. For speech test-1 and test-2, 72% and 80% accuracies of classification between healthy and impaired speech samples have been achieved respectively using the NB. For speech test-3, 64% correct classification is achieved using the NB. The results direct the possibility of speech impairment classification in PD patients based on the clinical rating scale.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper presents the development and evaluation of a method for enabling quantitative and automatic scoring of alternating tapping performance of patients with Parkinson’s disease (PD). Ten healthy elderly subjects and 95 patients in different clinical stages of PD have utilized a touch-pad handheld computer to perform alternate tapping tests in their home environments. First, a neurologist used a web-based system to visually assess impairments in four tapping dimensions (‘speed’, ‘accuracy’, ‘fatigue’ and ‘arrhythmia’) and a global tapping severity (GTS). Second, tapping signals were processed with time series analysis and statistical methods to derive 24 quantitative parameters. Third, principal component analysis was used to reduce the dimensions of these parameters and to obtain scores for the four dimensions. Finally, a logistic regression classifier was trained using a 10-fold stratified cross-validation to map the reduced parameters to the corresponding visually assessed GTS scores. Results showed that the computed scores correlated well to visually assessed scores and were significantly different across Unified Parkinson’s Disease Rating Scale scores of upper limb motor performance. In addition, they had good internal consistency, had good ability to discriminate between healthy elderly and patients in different disease stages, had good sensitivity to treatment interventions and could reflect the natural disease progression over time. In conclusion, the automatic method can be useful to objectively assess the tapping performance of PD patients and can be included in telemedicine tools for remote monitoring of tapping.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Objective: To develop a method for objective quantification of PD motor symptoms related to Off episodes and peak dose dyskinesias, using spiral data gathered by using a touch screen telemetry device. The aim was to objectively characterize predominant motor phenotypes (bradykinesia and dyskinesia), to help in automating the process of visual interpretation of movement anomalies in spirals as rated by movement disorder specialists. Background: A retrospective analysis was conducted on recordings from 65 patients with advanced idiopathic PD from nine different clinics in Sweden, recruited from January 2006 until August 2010. In addition to the patient group, 10 healthy elderly subjects were recruited. Upper limb movement data were collected using a touch screen telemetry device from home environments of the subjects. Measurements with the device were performed four times per day during week-long test periods. On each test occasion, the subjects were asked to trace pre-drawn Archimedean spirals, using the dominant hand. The pre-drawn spiral was shown on the screen of the device. The spiral test was repeated three times per test occasion and they were instructed to complete it within 10 seconds. The device had a sampling rate of 10Hz and measured both position and time-stamps (in milliseconds) of the pen tip. Methods: Four independent raters (FB, DH, AJ and DN) used a web interface that animated the spiral drawings and allowed them to observe different kinematic features during the drawing process and to rate task performance. Initially, a number of kinematic features were assessed including ‘impairment’, ‘speed’, ‘irregularity’ and ‘hesitation’ followed by marking the predominant motor phenotype on a 3-category scale: tremor, bradykinesia and/or choreatic dyskinesia. There were only 2 test occasions for which all the four raters either classified them as tremor or could not identify the motor phenotype. Therefore, the two main motor phenotype categories were bradykinesia and dyskinesia. ‘Impairment’ was rated on a scale from 0 (no impairment) to 10 (extremely severe) whereas ‘speed’, ‘irregularity’ and ‘hesitation’ were rated on a scale from 0 (normal) to 4 (extremely severe). The proposed data-driven method consisted of the following steps. Initially, 28 spatiotemporal features were extracted from the time series signals before being presented to a Multilayer Perceptron (MLP) classifier. The features were based on different kinematic quantities of spirals including radius, angle, speed and velocity with the aim of measuring the severity of involuntary symptoms and discriminate between PD-specific (bradykinesia) and/or treatment-induced symptoms (dyskinesia). A Principal Component Analysis was applied on the features to reduce their dimensions where 4 relevant principal components (PCs) were retained and used as inputs to the MLP classifier. Finally, the MLP classifier mapped these components to the corresponding visually assessed motor phenotype scores for automating the process of scoring the bradykinesia and dyskinesia in PD patients whilst they draw spirals using the touch screen device. For motor phenotype (bradykinesia vs. dyskinesia) classification, the stratified 10-fold cross validation technique was employed. Results: There were good agreements between the four raters when rating the individual kinematic features with intra-class correlation coefficient (ICC) of 0.88 for ‘impairment’, 0.74 for ‘speed’, 0.70 for ‘irregularity’, and moderate agreements when rating ‘hesitation’ with an ICC of 0.49. When assessing the two main motor phenotype categories (bradykinesia or dyskinesia) in animated spirals the agreements between the four raters ranged from fair to moderate. There were good correlations between mean ratings of the four raters on individual kinematic features and computed scores. The MLP classifier classified the motor phenotype that is bradykinesia or dyskinesia with an accuracy of 85% in relation to visual classifications of the four movement disorder specialists. The test-retest reliability of the four PCs across the three spiral test trials was good with Cronbach’s Alpha coefficients of 0.80, 0.82, 0.54 and 0.49, respectively. These results indicate that the computed scores are stable and consistent over time. Significant differences were found between the two groups (patients and healthy elderly subjects) in all the PCs, except for the PC3. Conclusions: The proposed method automatically assessed the severity of unwanted symptoms and could reasonably well discriminate between PD-specific and/or treatment-induced motor symptoms, in relation to visual assessments of movement disorder specialists. The objective assessments could provide a time-effect summary score that could be useful for improving decision-making during symptom evaluation of individualized treatment when the goal is to maximize functional On time for patients while minimizing their Off episodes and troublesome dyskinesias.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A challenge for the clinical management of advanced Parkinson’s disease (PD) patients is the emergence of fluctuations in motor performance, which represents a significant source of disability during activities of daily living of the patients. There is a lack of objective measurement of treatment effects for in-clinic and at-home use that can provide an overview of the treatment response. The objective of this paper was to develop a method for objective quantification of advanced PD motor symptoms related to off episodes and peak dose dyskinesia, using spiral data gathered by a touch screen telemetry device. More specifically, the aim was to objectively characterize motor symptoms (bradykinesia and dyskinesia), to help in automating the process of visual interpretation of movement anomalies in spirals as rated by movement disorder specialists. Digitized upper limb movement data of 65 advanced PD patients and 10 healthy (HE) subjects were recorded as they performed spiral drawing tasks on a touch screen device in their home environment settings. Several spatiotemporal features were extracted from the time series and used as inputs to machine learning methods. The methods were validated against ratings on animated spirals scored by four movement disorder specialists who visually assessed a set of kinematic features and the motor symptom. The ability of the method to discriminate between PD patients and HE subjects and the test-retest reliability of the computed scores were also evaluated. Computed scores correlated well with mean visual ratings of individual kinematic features. The best performing classifier (Multilayer Perceptron) classified the motor symptom (bradykinesia or dyskinesia) with an accuracy of 84% and area under the receiver operating characteristics curve of 0.86 in relation to visual classifications of the raters. In addition, the method provided high discriminating power when distinguishing between PD patients and HE subjects as well as had good test-retest reliability. This study demonstrated the potential of using digital spiral analysis for objective quantification of PD-specific and/or treatment-induced motor symptoms.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Os resultados das análises feitas com estes dados indicaram diferenças significativas no aumento da amplitude do plano meridiano horizontal nasal do campo visual monocular, medidas em unidades angulares. As diferenças foram interpretadas como indicativas da influência dos três diferentes níveis de complexidade dos estímulos visuais. Concluiu-se, portanto, que a variável colativa por complexidade influi no ato perceptual do reconhecimento visual.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Ornamental fish may be severely affected by a stressful environment. Stressors impair the immune response, reproduction and growth rate; thus, the identification of possible stressors will aid to improve the overall quality of ornamental fish. The aim of this study was to determine whole-body cortisol of adult zebrafish, Danio rerio, following visual or direct contact with a predator species. Zebrafish were distributed in three groups: the first group, which consisted of zebrafish reared completely isolated of the predator, was considered the negative control; the second group, in which the predator, Parachromis managuensis was stocked together with zebrafish, was considered the positive control; the third group consisted of zebrafish stocked in a glass aquarium, with direct visual contact with the predator. The mean whole-body cortisol concentration in zebrafish from the negative control was 6.78 +/- 1.12 ng g(-1), a concentration statistically lower than that found in zebrafish having visual contact with the predator (9.26 +/- 0.88 ng g(-1)) which, in turn, was statistically lower than the mean whole-body cortisol of the positive control group (12.35 +/- 1.59 ng g(-1)). The higher whole-body cortisol concentration found in fish from the positive control can be attributed to the detection, by the zebrafish, of relevant risk situations that may involve a combination of chemical, olfactory and visual cues. One of the functions of elevated cortisol is to mobilize energy from body resources to cope with stress. The elevation of whole-body cortisol in fish subjected to visual contact with the predator involves only the visual cue in the recognition of predation risk. We hypothesized that the zebrafish could recognize predator characteristics in P managuensis, such as length, shape, color and behavior. Nonetheless, the elevation of whole-body cortisol in zebrafish suggested that the visual contact of the predator may elicit a stress response in prey fish. This assertion has a strong practical application concerning the species distribution in ornamental fish markets in which prey species should not be allowed to see predator species. Minimizing visual contact between prey and predator fish may improve the quality, viability and welfare of small fish in ornamental fish markets. (c) 2007 Elsevier B.V. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This work uses computer vision algorithms related to features in the identification of medicine boxes for the visually impaired. The system is for people who have a disease that compromises his vision, hindering the identification of the correct medicine to be ingested. We use the camera, available in several popular devices such as computers, televisions and phones, to identify the box of the correct medicine and audio through the image, showing the poor information about the medication, such: as the dosage, indication and contraindications of the medication. We utilize a model of object detection using algorithms to identify the features in the boxes of drugs and playing the audio at the time of detection of feauteres in those boxes. Experiments carried out with 15 people show that where 93 % think that the system is useful and very helpful in identifying drugs for boxes. So, it is necessary to make use of this technology to help several people with visual impairments to take the right medicine, at the time indicated in advance by the physician

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The purpose of this study was to determine the influence of hearing protection devices (HPDs) on the understanding of speech in young adults with normal hearing, both in a silent situation and in the presence of ambient noise. The experimental research was carried out with the following variables: five different conditions of HPD use (without protectors, with two earplugs and with two earmuffs); a type of noise (pink noise); 4 test levels (60, 70, 80 and 90 dB[A]); 6 signal/noise ratios (without noise, + 5, + 10, zero, - 5 and - 10 dB); 5 repetitions for each case, totalling 600 tests with 10 monosyllables in each one. The variable measure was the percentage of correctly heard words (monosyllabic) in the test. The results revealed that, at the lowest levels (60 and 70 dB), the protectors reduced the intelligibility of speech (compared to the tests without protectors) while, in the presence of ambient noise levels of 80 and 90 dB and unfavourable signal/noise ratios (0, -5 and -10 dB), the HPDs improved the intelligibility. A comparison of the effectiveness of earplugs versus earmuffs showed that the former offer greater efficiency in respect to the recognition of speech, providing a 30% improvement over situations in which no protection is used. As might be expected, this study confirmed that the protectors' influence on speech intelligibility is related directly to the spectral curve of the protector's attenuation. (C) 2003 Elsevier B.V. Ltd. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

OBJETIVO: O objetivo geral foi detectar fatores ambliopigênicos em uma população de pré-escolares, utilizando exames refratométricos e o PhotoScreenerTM (PS) e o objetivo específico foi verificar se a avaliação feita com o PS é útil como método de triagem em campanhas de prevenção de ambliopia em crianças. MÉTODOS: Foi realizado um estudo observacional, prospectivo, de janeiro a dezembro de 2007, avaliando-se 227 crianças pré-escolares, com o intuito de detectar, através da aplicação de um questionário, exames refratométricos e fotografias utilizando o PS, a presença de fatores causadores de ambliopia na população de estudo. Todas as crianças foram avaliadas pelo PS. em seguida, todas as crianças foram submetidas à cicloplegia , sendo avaliadas usando refrator automático Shin Nippon®. As crianças detectadas como portadoras de problemas oculares receberam prescrição óptica, segundo os critérios: hipermetropia maior que +1,50 D, miopia maior que -1,00 D e astigmatismo maior que 1,00 D. Analisaram-se os dados através do teste de concordância de Goodman, estatística descritiva e estudo da especificidade e sensibilidade ao emprego do PS, comparando os resultados com ele obtidos, com os resultados dos outros métodos de avaliação oftalmológica. RESULTADOS: A distribuição entre os sexos foi semelhante, sendo que a maioria das crianças apresentava quatro ou cinco anos de idade. A sensibilidade (S) do PS, comparando-se o resultado obtido neste aparelho com o autorrefrator sob cicloplegia, foi de 50,9%. Já a especificidade foi de 78,9%; valor preditivo positivo 70%; valor preditivo negativo 62,5% e acurácia 65,1%. CONCLUSÕES: Das 101 crianças cujas fotografias tiradas através do PS puderam ser analisadas satisfatoriamente, trinta e seis apresentavam erro refrativo que necessitou de correção. O PS, quando comparado com equivalente esférico do autorrefrator sob cicloplegia, é um método razoável de triagem, embora a sensibilidade não seja boa. Um ponto positivo a ser ressaltado é o considerável valor de especificidade.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

TEMA: o objetivo deste estudo foi descrever os aspectos clínico, comportamental, cognitivo e comunicativo de indivíduos com o diagnóstico genético da Síndrome Smith-Magenis. PROCEDIMENTOS: participaram dois indivíduos do sexo masculino, de nove e 19 anos. Realizou-se a avaliação genética clínica e laboratorial (teste FISH, utilizando sonda para região 17p11.2). A avaliação psicológica constou da observação comportamental e aplicação da Escala Wechsler de Inteligência. A avaliação Fonoaudiológica foi realizada por meio de procedimentos formais e informais e avaliação auditiva periférica. RESULTADOS: a análise genética clínica evidenciou as características fenotípicas da síndrome Smith-Magenis, confirmada pela avaliação laboratorial. A avaliação psicológica evidenciou o fenótipo comportamental peculiar da síndrome Smith-Magenis e comprovou a deficiência intelectual de grau moderado nos dois indivíduos. A avaliação fonoaudiológica mostrou alterações no desempenho linguístico, com alterações nos níveis fonológico, semântico, sintático e pragmático e nas habilidades psicolinguísticas, interferindo nas habilidades comunicativas e de aprendizagem. A avaliação auditiva indicou audição periférica dentro de parâmetros de normalidade. CONCLUSÃO: a avaliação multidisciplinar favoreceu a descrição dos aspectos clínicos, comportamentais, cognitivos que pertencem ao fenótipo comportamental da síndrome Smith-Magenis e permitiu verificar que estes apresentam graves alterações da linguagem oral, das habilidades psicolinguísticas e do processamento das informações visuais e auditivas com reflexos marcantes no desenvolvimento das habilidades comunicativas e processos de aprendizagem.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A deficiência auditiva é um dos achados clínicos mais comuns em sujeitos com malformações de orelha. O tratamento consiste em realizar a cirurgia e/ou adaptar o aparelho de amplificação sonora por via óssea (AASI VO). A intervenção precoce é fundamental para favorecer a estimulação auditiva e desenvolvimento da fala e linguagem. OBJETIVO: Caracterizar o perfil audiológico de sujeitos com malformação congênita de orelha externa e/ou média e avaliar o benefício e a satisfação destes com o uso de AASI VO. MÉTODO: Estudo descritivo, sujeitos com malformações congênitas bilaterais de orelha externa e/ou média, deficiência auditiva condutiva ou mista, moderada ou grave e usuários de AASI VO. Avaliação do benefício utilizando teste de reconhecimento de sentenças com ruído competitivo e medidas de ganho funcional e avaliação da satisfação utilizando questionário internacional QI - AASI. RESULTADOS: Foram avaliados 13 sujeitos, sendo 61% do sexo masculino e 80% com deficiência auditiva condutiva moderada ou grave. Houve melhor desempenho na avaliação proposta na condição com AASI, quando comparada à condição sem AASI. CONCLUSÃO: Os AASI VO retroauriculares apresentaram vantagens para a população estudada e devem ser considerados como uma opção para intervenção. A satisfação foi confirmada pelos escores elevados obtidos no QI - AASI.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Este estudo investigou a influência de características do estímulo visual e o efeito da intenção nas respostas do controle postural frente à manipulação visual de adultas idosas. As 20 participantes permaneceram em pé em uma sala móvel durante sete tentativas com duração de 1 minuto cada, olhando para um alvo fixo, medindo-se sua oscilação corporal. Na primeira tentativa não houve qualquer movimento da sala, porém a partir da segunda a sala foi movimentada no sentido ântero-posterior. Para dez participantes, a velocidade de pico da movimentação foi de 0,6 cm/s e, para as demais, de 1,0 cm/s. A partir da quinta tentativa, as participantes foram informadas do movimento da sala e orientadas a resistir à movimentação. Os resultados indicam que a oscilação corporal das idosas é induzida pelo movimento da sala móvel. Intenção e alteração da característica do estímulo visual reduzem a influência da informação visual na oscilação corporal, mas a manipulação de propriedade do estímulo (neste caso, velocidade), é menos efetiva que a intenção. Essa maior dependência da intenção para alterar a influência de um estímulo sensorial no controle postural indica que o funcionamento do sistema de controle postural em idosos não possibilita ajustes automáticos de respostas posturais frente a pequenas variações das condições ambientais. Iinformações sobre tais variações podem ser direcionadas de forma a compensar essa diferença.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

O presente artigo encontra-se inserido dentro de um estudo que busca compreender as principais alternativas para a inclusão de alunos com deficiência visual no contexto do ensino de física. Focalizando aulas de óptica, analisa as viabilidades comunicacionais entre licenciandos e discentes com deficiência visual. Para tal, enfatiza as estruturas empírica e semântico-sensorial das linguagens utilizadas, indicando fatores geradores de acessibilidade às informações veiculadas. Recomenda, ainda, alternativas que visam dar condições à participação efetiva do discente com deficiência visual no processo comunicativo, das quais se destacam: a identificação da estrutura semântico-sensorial dos significados veiculados, o conhecimento da história visual do aluno, a utilização de linguagens de estrutura empírica tátil-auditiva interdependente em contextos interativos, bem como, a exploração das potencialidades comunicacionais das linguagens constituídas de estruturas empíricas fundamental auditiva, e auditiva e visual independentes.