73 resultados para audio visual speech recognition
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)
Resumo:
Audiometry is the main way with which hearing is evaluated, because it is a universal and standardized test. Speech tests are difficult to standardize due to the variables involved, their performance in the presence of competitive noise is of great importance. Aim: To characterize speech intelligibility in silence and in competitive noise from individuals exposed to electronically amplified music. Material and Method: It was performed with 20 university students who presented normal hearing thresholds. The speech recognition rate (SRR) was performed after fourteen hours of sound rest after the exposure to electronically amplified music and once again after sound rest, being studied in three stages: without competitive noise, in the presence of Babble-type competitive noise, in monotic listening, in signal/ noise ratio of + 5 dB and with the signal/ noise ratio of 5 dB. Results: There was greater damage in the SRR after exposure to the music and with competitive noise, and as the signal/ noise ratio decreases, the performance of individuals in the test also decreased. Conclusion: The inclusion of competitive noise in the speech tests in the audiological routine is important, because it represents the real disadvantage experienced by individuals in daily listening.
Resumo:
The aim of this Study was to compare the learning process of a highly complex ballet skill following demonstrations of point light and video models 16 participants divided into point light and video groups (ns = 8) performed 160 trials of a pirouette equally distributed in blocks of 20 trials alternating periods of demonstration and practice with a retention test a day later Measures of head and trunk oscillation coordination d1 parity from the model and movement time difference showed similarities between video and point light groups ballet experts evaluations indicated superiority of performance in the video over the point light group Results are discussed in terms of the task requirements of dissociation between head and trunk rotations focusing on the hypothesis of sufficiency and higher relevance of information contained in biological motion models applied to learning of complex motor skills
Resumo:
Speech understanding disorders in the elderly may be due to peripheral or central auditory dysfunctions. Asymmetry of results in dichotic testing increases with age, and may reflect on a lack of inter-hemisphere transmission and cognitive decline. Aim: To investigate auditory processing of aged people with no hearing complaints. Study design: clinical prospective. Materials and Methods: Twenty-two voluntary individuals, aged between 55 and 75 years, were evaluated. They reported no hearing complaints and had maximal auditory thresholds of 40 dB HL until 4 KHz, 80% of minimal speech recognition scores and peripheral symmetry between the ears. We used two kinds of tests: speech in noise and dichotic alternated dissyllables (SSW). Results were compared between males and females, right and left ears and between age groups. Results: There were no significant differences between genders, in both tests. Their Left ears showed worse results, in the competitive condition of SSW. Individuals aged 65 or older had poorer performances than those aged 55 to 64. Conclusion: Central auditory tests showed worse performance with aging. The employment of a dichotic test in the auditory evaluation setting in the elderly may help in the early identification of degenerative processes, which are common among these patients.
Resumo:
Dynamic Time Warping (DTW), a pattern matching technique traditionally used for restricted vocabulary speech recognition, is based on a temporal alignment of the input signal with the template models. The principal drawback of DTW is its high computational cost as the lengths of the signals increase. This paper shows extended results over our previously published conference paper, which introduces an optimized version of the DTW I hat is based on the Discrete Wavelet Transform (DWT). (C) 2008 Elsevier B.V. All rights reserved.
Resumo:
A síndrome do X Frágil é a causa mais frequente de deficiência intelectual hereditária. A variante de Dandy-Walker trata-se de uma constelação específica de achados neurorradiológicos. Este estudo relata achados da comunicação oral e escrita de um menino de 15 anos com diagnóstico clínico e molecular da síndrome do X-Frágil e achados de neuroimagem do encéfalo compatíveis com variante de Dandy-Walker. A avaliação fonoaudiológica foi realizada por meio da Observação do Comportamento Comunicativo, aplicação do ABFW - Teste de Linguagem Infantil - Fonologia, Perfil de Habilidades Fonológicas, Teste de Desempenho Escolar, Teste Illinois de Habilidades Psicolinguísticas, avaliação do sistema estomatognático e avaliação audiológica. Observou-se: alteração de linguagem oral quanto às habilidades fonológicas, semânticas, pragmáticas e morfossintáticas; déficits nas habilidades psicolinguísticas (recepção auditiva, expressão verbal, combinação de sons, memória sequencial auditiva e visual, closura auditiva, associação auditiva e visual); e alterações morfológicas e funcionais do sistema estomatognático. Na leitura verificou-se dificuldades na decodificação dos símbolos gráficos e na escrita havia omissões, aglutinações e representações múltiplas com o uso predominante de vogais e dificuldades na organização viso-espacial. Em matemática, apesar do reconhecimento numérico, não realizou operações aritméticas. Não foram observadas alterações na avaliação audiológica periférica. A constelação de sintomas comportamentais, cognitivos, linguísticos e perceptivos, previstos na síndrome do X-Frágil, somada às alterações estruturais do sistema nervoso central, pertencentes à variante de Dandy-Walker, trouxeram interferências marcantes no desenvolvimento das habilidades comunicativas, no aprendizado da leitura e escrita e na integração social do indivíduo.
Resumo:
Motivated by a recently proposed biologically inspired face recognition approach, we investigated the relation between human behavior and a computational model based on Fourier-Bessel (FB) spatial patterns. We measured human recognition performance of FB filtered face images using an 8-alternative forced-choice method. Test stimuli were generated by converting the images from the spatial to the FB domain, filtering the resulting coefficients with a band-pass filter, and finally taking the inverse FB transformation of the filtered coefficients. The performance of the computational models was tested using a simulation of the psychophysical experiment. In the FB model, face images were first filtered by simulated V1- type neurons and later analyzed globally for their content of FB components. In general, there was a higher human contrast sensitivity to radially than to angularly filtered images, but both functions peaked at the 11.3-16 frequency interval. The FB-based model presented similar behavior with regard to peak position and relative sensitivity, but had a wider frequency band width and a narrower response range. The response pattern of two alternative models, based on local FB analysis and on raw luminance, strongly diverged from the human behavior patterns. These results suggest that human performance can be constrained by the type of information conveyed by polar patterns, and consequently that humans might use FB-like spatial patterns in face processing.
Resumo:
Profound hearing loss is a disability that affects personality and when it involves teenagers before language acquisition, these bio-psychosocial conflicts can be exacerbated, requiring careful evaluation and choice of them for cochlear implant. Aim: To evaluate speech perception by adolescents with profound hearing loss, users of cochlear Implants. Study Design: Prospective. Materials and Methods: Twenty-five individuals with severe or profound pre-lingual hearing loss who underwent cochlear implantation during adolescence, between 10 to 17 years and 11 months, who went through speech perception tests before the implant and 2 years after device activation. For comparison and analysis we used the results from tests of four choice, recognition of vowels and recognition of sentences in a closed setting and the open environment. Results: The average percentage of correct answers in the four choice test before the implant was 46.9% and after 24 months of device use, this value went up to 86.1% in the vowels recognition test, the average difference was 45.13% to 83.13% and the sentences recognition test together in closed and open settings was 19.3% to 60.6% and 1.08% to 20.47% respectively. Conclusion: All patients, although with mixed results, achieved statistical improvement in all speech tests that were employed.
Resumo:
Background Patients with early age-related maculopathy ( ARM) do not necessarily show obvious morphological signs or functional impairment. Many have good visual acuity, yet complain of decreased visual performance. The aim of this study was to investigate the aging effects on performance of parafoveal letter recognition at reduced contrast, and defects caused by early ARM and normal fellow eyes of patients with unilateral age-related macular degeneration (nfAMD). Methods Testing of the central visual field (8 radius) was performed by the Macular Mapping Test (MMT) using recognition of letters in 40 parafoveal target locations at four contrast levels (5, 10, 25 and 100%). Effects of aging were investigated in 64 healthy subjects aged 23 to 76 years (CTRL). In addition, 39 eyes (minimum visual acuity of 0.63; 20/30) from 39 patients with either no visible signs of ARM, while the fellow eye had advanced age-related macular degeneration (nfAMD; n=12), or early signs of ARM (eARM; n=27) were examined. Performance was expressed summarily as a ""field score"" (FS). Results Performance in the MMT begins to decline linearly with age in normal subjects from the age of 50 and 54 years on, at 5% and 10% contrast respectively. The differentiation between patients and CTRLs was enhanced if FS at 5% was analyzed along with FS at 10% contrast. In 8/12 patients from group nfAMD and in 18/27 from group eARM, the FS was statistically significantly lower than in the CTRL group in at least one of the lower contrast levels. Conclusion Using parafoveal test locations, a recognition task and diminished contrast increases the chance of early detection of functional defects due to eARM or nfAMD and can differentiate them from those due to aging alone.
Dynamic Changes in the Mental Rotation Network Revealed by Pattern Recognition Analysis of fMRI Data
Resumo:
We investigated the temporal dynamics and changes in connectivity in the mental rotation network through the application of spatio-temporal support vector machines (SVMs). The spatio-temporal SVM [Mourao-Miranda, J., Friston, K. J., et al. (2007). Dynamic discrimination analysis: A spatial-temporal SVM. Neuroimage, 36, 88-99] is a pattern recognition approach that is suitable for investigating dynamic changes in the brain network during a complex mental task. It does not require a model describing each component of the task and the precise shape of the BOLD impulse response. By defining a time window including a cognitive event, one can use spatio-temporal fMRI observations from two cognitive states to train the SVM. During the training, the SVM finds the discriminating pattern between the two states and produces a discriminating weight vector encompassing both voxels and time (i.e., spatio-temporal maps). We showed that by applying spatio-temporal SVM to an event-related mental rotation experiment, it is possible to discriminate between different degrees of angular disparity (0 degrees vs. 20 degrees, 0 degrees vs. 60 degrees, and 0 degrees vs. 100 degrees), and the discrimination accuracy is correlated with the difference in angular disparity between the conditions. For the comparison with highest accuracy (08 vs. 1008), we evaluated how the most discriminating areas (visual regions, parietal regions, supplementary, and premotor areas) change their behavior over time. The frontal premotor regions became highly discriminating earlier than the superior parietal cortex. There seems to be a parcellation of the parietal regions with an earlier discrimination of the inferior parietal lobe in the mental rotation in relation to the superior parietal. The SVM also identified a network of regions that had a decrease in BOLD responses during the 100 degrees condition in relation to the 0 degrees condition (posterior cingulate, frontal, and superior temporal gyrus). This network was also highly discriminating between the two conditions. In addition, we investigated changes in functional connectivity between the most discriminating areas identified by the spatio-temporal SVM. We observed an increase in functional connectivity between almost all areas activated during the 100 degrees condition (bilateral inferior and superior parietal lobe, bilateral premotor area, and SMA) but not between the areas that showed a decrease in BOLD response during the 100 degrees condition.
Resumo:
A redução da disponibilidade de espécies de madeiras nativas e seus efeitos na economia, associada ao fortalecimento dos conceitos de preservação ambiental, criou a necessidade de desenvolvimento de alternativas viáveis para utilização racional de espécies de reflorestamento. E uma das opções é a realização de classificação visual das peças. Autores de trabalhos desenvolvidos nessa linha de pesquisa verificaram a adequação das regras de classificação visual do Southern Pine Inspection Bureau (SPIB) dos EUA à madeira de Pinus do Brasil e apresentaram proposta para normalizar o processo de classificação visual dessa madeira. Nessa classificação, os aspectos com maior influência são: presença de nós, desvio de grã em relação ao eixo da peça e densidade de anéis de crescimento. Assim, esta pesquisa apresenta um estudo experimental que consistiu na classificação visual e determinação da resistência à tração de 85 peças de Pinus spp e um estudo teórico, que propôs uma equação para determinar a resistência à tração média de peças estruturais em função da classificação visual. Com este trabalho, foi possível observar a influência dos nós e dos anéis de crescimento sobre a resistência à tração das peças analisadas.
Resumo:
A avaliação da dor em animais necessita da utilização de escalas de avaliação, que dependem da interpretação realizada por observadores. O objetivo do presente estudo foi avaliar a correlação entre a escala visual analógica (EVA), escala de Melbourne e os filamentos de Von Frey, na avaliação da dor pós-operatória em 42 cadelas adultas e saudáveis, submetidas à ovariossalpingohisterectomia (OSH). A dor pós-operatória foi avaliada por dois observadores cegos aos tratamentos analgésicos, em intervalos de uma hora, utilizando a EVA, a escala de Melbourne e os filamentos de Von Frey, aplicados ao redor da incisão cirúrgica. Foram considerados como critérios para realização da analgesia resgate uma pontuação de 50mm na EVA ou de 13 pontos na escala de Melbourne. A EVA revelou-se a escala mais sensível, uma vez que 100% dos animais receberam resgate seguindo esse método. Os valores obtidos na EVA e na escala de Melbourne determinaram boa correlação, com r=0,74, o que não ocorreu com os filamentos de Von Frey (r=-0,18). Já a correlação entre a escala de Melbourne e os filamentos de Von Frey foi de -0.37. Apesar de a EVA e a escala de Melbourne apresentarem boa correlação, sugere-se que se considere uma pontuação menor na escala de Melbourne como critério para administração de analgesia resgate.
Resumo:
OBJETIVO: Realizar o levantamento do quantitativo dos procedimentos relacionados à adaptação de aparelho de amplificação sonora individual (AASI) incluídos na Tabela do Sistema Único de Saúde (Tabela SUS). MÉTODOS: Os dados sobre os procedimentos relacionados à adaptação de AASI incluídos na Tabela SUS foram levantados no site www.datasus.gov.br. Após o levantamento desses dados, foi realizada a organização e a análise descritiva da produção dos atendimentos ambulatoriais registrados pelos serviços de saúde auditiva do Brasil, durante o período de novembro de 2004 a julho de 2010. Os dados foram analisados estatisticamente. RESULTADOS: Quanto aos procedimentos relacionados à dispensação de AASI no território nacional no âmbito da saúde auditiva, em 2006, a terapia fonoaudiológica ultrapassou o quantitativo obtido pela adaptação de AASI e, o acompanhamento fonoaudiológico, por sua vez, foi pouco realizado no país. Os AASI com tecnologias B e C vem sendo mais adaptados do que os AASI de tecnologia A e a realização de medida com microfone sonda ou acoplador de 2cc na adaptação dos AASI é pouco realizada em comparação ao ganho funcional. CONCLUSÃO: Houve grandes avanços na atenção ao deficiente auditivo no país, mas é necessário aprimorar o acompanhamento dos usuários de AASI, e revisar procedimentos como medidas com microfone sonda e tecnologias dos AASI.
Resumo:
A modified version of the intruder-resident paradigm was used to investigate if social recognition memory lasts at least 24 h. One hundred and forty-six adult male Wistar rats were used. Independent groups of rats were exposed to an intruder for 0.083, 0.5, 2, 24, or 168 h and tested 24 h after the first encounter with the familiar or a different conspecific. Factor analysis was employed to identify associations between behaviors and treatments. Resident rats exhibited a 24-h social recognition memory, as indicated by a 3- to 5-fold decrease in social behaviors in the second encounter with the same conspecific compared to those observed for a different conspecific, when the duration of the first encounter was 2 h or longer. It was possible to distinguish between two different categories of social behaviors and their expression depended on the duration of the first encounter. Sniffing the anogenital area (49.9% of the social behaviors), sniffing the body (17.9%), sniffing the head (3%), and following the conspecific (3.1%), exhibited mostly by resident rats, characterized social investigation and revealed long-term social recognition memory. However, dominance (23.8%) and mild aggression (2.3%), exhibited by both resident and intruders, characterized social agonistic behaviors and were not affected by memory. Differently, sniffing the environment (76.8% of the non-social behaviors) and rearing (14.3%), both exhibited mostly by adult intruder rats, characterized non-social behaviors. Together, these results show that social recognition memory in rats may last at least 24 h after a 2-h or longer exposure to the conspecific.
Resumo:
TEMA: avaliação audiológica de pais de indivíduos com perda auditiva de herança autossômica recessiva. OBJETIVO: estudar o perfil audiológico de pais de indivíduos com perda auditiva, de herança autossômica recessiva, inferida pela história familial ou por testes moleculares que detectaram mutação no gene GJB2, responsável por codificar a Conexina 26. MÉTODO: 36 indivíduos entre 30 e 60 anos foram avaliados e divididos em dois grupos: grupo controle, sem queixas auditivas e sem história familiar de deficiência auditiva, e grupo de estudos composto por pais heterozigotos em relação a genes de surdez de herança autossômica recessiva inespecífica ou portadores heterozigotos de mutação no gene da Conexina 26. Todos foram submetidos à audiometria tonal liminar (0,25kHz a 8), audiometria de altas freqüências (9kHz a 20) e emissões otoacústicas produtos de distorção (EOAPD). RESULTADOS: houve diferenças significativas na amplitude das EOAPD nas freqüências 1001 e 1501Hz entre os grupos, sendo maior a amplitude no grupo controle. Não houve diferença significativa entre os grupos para os limiares tonais de 0,25 a 20KHz. CONCLUSÃO: as EOAPD foram mais eficazes, em comparação com a audiometria tonal liminar, para detectar diferenças auditivas entre os grupos. Mais pesquisas são necessárias para verificar a confiabilidade destes dados.
Resumo:
PURPOSE: To report a new, direct visual approach for rat pinealectomy. METHODS: Eighty adult female rats (Rattus norvegicus albinus EPM-1 strain) were weighted and anesthetized intraperitoneally with 15 mg/kg xylazine and 30 mg/kg ketamine. The animal was fastened to a dissection table, an incision was made in the skin and the subcutaneous tissue, bringing the lambda into view. The skullcap was opened with a dental drill, bringing the cerebral hemispheres and the superior sagittal sinus into view. The pineal gland, located under the venous sinus, was removed in a single piece using tweezers. Next, the bone fragment was returned to its place and the surgical layers were sutured. RESULTS: This new technique is easy to be done, avoids bleedings and removes only the pineal gland without damage to the remaining encephalon. In addition it makes possible the achievement of a sham surgery, allowing the pineal gland to remain intact. CONCLUSION: The proposed technique intends to facilitate studies aiming to better understanding the complexity and importance of the pineal gland on reproductive and other body systems.