61 resultados para Visual Speaker Recognition, Visual Speech Recognition, Cascading Appearance-Based Features
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)
Resumo:
In this paper we present a new wavelet-based algorithm for low-cost computation of the cepstrum. It can be used for real time precise pitch determination in automatic speech and speaker recognition systems. Many wavelet families are examined to determine the one that works best. The results confirm the efficacy and accuracy of the proposed technique for pitch extraction. (C) 2008 Elsevier B.V. All rights reserved.
Resumo:
The goal of the current study was to compare the quality of esophageal speech and voice to videofluoroscopic features of the esophagus and pharyngoesophageal (PE) segment. The speech and voice characteristics of 30 laryngectomized patients were rated by 5 speech-language pathologists. Based on these ratings, patients were divided into 3 categories: fluent (n = 9), moderately fluent (n = 10) and nonfluent (n = 11). Videofluoroscopy of the PE region was then performed during both swallowing and voice production. An insufflation test and percutaneous pharyngeal plexus block were required in 9 patients to determine the etiology of poor esophageal voice production. The strongest videofluoroscopic indicators of nonfluent speakers were: (1) small or absent air reservoir and (2) lack of a vibrating PE segment. Fluent speakers presented with shorter PE segments (1.17 mm) compared to moderately fluent speakers (17.1-29.9 mm). Perceptually, fluent speakers presented with a predominantly rough vocal quality. In contrast, moderately fluent speakers presented with a tense quality. In addition, stoma blast noise was reduced in fluent speakers. Videofluoroscopic findings highly correlated with the quality of esophageal speech. Copyright (C) 2009 S. Karger AG, Basel
Resumo:
Additional neurological features have recently been described in seven families transmitting pathogenic mutations in OPA1, the most common cause of autosomal dominant optic atrophy. However, the frequency of these syndromal `dominant optic atrophy plus` variants and the extent of neurological involvement have not been established. In this large multi-centre study of 104 patients from 45 independent families, including 60 new cases, we show that extra-ocular neurological complications are common in OPA1 disease, and affect up to 20% of all mutational carriers. Bilateral sensorineural deafness beginning in late childhood and early adulthood was a prominent manifestation, followed by a combination of ataxia, myopathy, peripheral neuropathy and progressive external ophthalmoplegia from the third decade of life onwards. We also identified novel clinical presentations with spastic paraparesis mimicking hereditary spastic paraplegia, and a multiple sclerosis-like illness. In contrast to initial reports, multi-system neurological disease was associated with all mutational subtypes, although there was an increased risk with missense mutations [odds ratio = 3.06, 95% confidence interval = 1.44-6.49; P = 0.0027], and mutations located within the guanosine triphosphate-ase region (odds ratio = 2.29, 95% confidence interval = 1.08-4.82; P = 0.0271). Histochemical and molecular characterization of skeletal muscle biopsies revealed the presence of cytochrome c oxidase-deficient fibres and multiple mitochondrial DNA deletions in the majority of patients harbouring OPA1 mutations, even in those with isolated optic nerve involvement. However, the cytochrome c oxidase-deficient load was over four times higher in the dominant optic atrophy + group compared to the pure optic neuropathy group, implicating a causal role for these secondary mitochondrial DNA defects in disease pathophysiology. Individuals with dominant optic atrophy plus phenotypes also had significantly worse visual outcomes, and careful surveillance is therefore mandatory to optimize the detection and management of neurological disability in a group of patients who already have significant visual impairment.
Resumo:
Motivated by a recently proposed biologically inspired face recognition approach, we investigated the relation between human behavior and a computational model based on Fourier-Bessel (FB) spatial patterns. We measured human recognition performance of FB filtered face images using an 8-alternative forced-choice method. Test stimuli were generated by converting the images from the spatial to the FB domain, filtering the resulting coefficients with a band-pass filter, and finally taking the inverse FB transformation of the filtered coefficients. The performance of the computational models was tested using a simulation of the psychophysical experiment. In the FB model, face images were first filtered by simulated V1- type neurons and later analyzed globally for their content of FB components. In general, there was a higher human contrast sensitivity to radially than to angularly filtered images, but both functions peaked at the 11.3-16 frequency interval. The FB-based model presented similar behavior with regard to peak position and relative sensitivity, but had a wider frequency band width and a narrower response range. The response pattern of two alternative models, based on local FB analysis and on raw luminance, strongly diverged from the human behavior patterns. These results suggest that human performance can be constrained by the type of information conveyed by polar patterns, and consequently that humans might use FB-like spatial patterns in face processing.
Resumo:
Background Patients with early age-related maculopathy ( ARM) do not necessarily show obvious morphological signs or functional impairment. Many have good visual acuity, yet complain of decreased visual performance. The aim of this study was to investigate the aging effects on performance of parafoveal letter recognition at reduced contrast, and defects caused by early ARM and normal fellow eyes of patients with unilateral age-related macular degeneration (nfAMD). Methods Testing of the central visual field (8 radius) was performed by the Macular Mapping Test (MMT) using recognition of letters in 40 parafoveal target locations at four contrast levels (5, 10, 25 and 100%). Effects of aging were investigated in 64 healthy subjects aged 23 to 76 years (CTRL). In addition, 39 eyes (minimum visual acuity of 0.63; 20/30) from 39 patients with either no visible signs of ARM, while the fellow eye had advanced age-related macular degeneration (nfAMD; n=12), or early signs of ARM (eARM; n=27) were examined. Performance was expressed summarily as a ""field score"" (FS). Results Performance in the MMT begins to decline linearly with age in normal subjects from the age of 50 and 54 years on, at 5% and 10% contrast respectively. The differentiation between patients and CTRLs was enhanced if FS at 5% was analyzed along with FS at 10% contrast. In 8/12 patients from group nfAMD and in 18/27 from group eARM, the FS was statistically significantly lower than in the CTRL group in at least one of the lower contrast levels. Conclusion Using parafoveal test locations, a recognition task and diminished contrast increases the chance of early detection of functional defects due to eARM or nfAMD and can differentiate them from those due to aging alone.
Dynamic Changes in the Mental Rotation Network Revealed by Pattern Recognition Analysis of fMRI Data
Resumo:
We investigated the temporal dynamics and changes in connectivity in the mental rotation network through the application of spatio-temporal support vector machines (SVMs). The spatio-temporal SVM [Mourao-Miranda, J., Friston, K. J., et al. (2007). Dynamic discrimination analysis: A spatial-temporal SVM. Neuroimage, 36, 88-99] is a pattern recognition approach that is suitable for investigating dynamic changes in the brain network during a complex mental task. It does not require a model describing each component of the task and the precise shape of the BOLD impulse response. By defining a time window including a cognitive event, one can use spatio-temporal fMRI observations from two cognitive states to train the SVM. During the training, the SVM finds the discriminating pattern between the two states and produces a discriminating weight vector encompassing both voxels and time (i.e., spatio-temporal maps). We showed that by applying spatio-temporal SVM to an event-related mental rotation experiment, it is possible to discriminate between different degrees of angular disparity (0 degrees vs. 20 degrees, 0 degrees vs. 60 degrees, and 0 degrees vs. 100 degrees), and the discrimination accuracy is correlated with the difference in angular disparity between the conditions. For the comparison with highest accuracy (08 vs. 1008), we evaluated how the most discriminating areas (visual regions, parietal regions, supplementary, and premotor areas) change their behavior over time. The frontal premotor regions became highly discriminating earlier than the superior parietal cortex. There seems to be a parcellation of the parietal regions with an earlier discrimination of the inferior parietal lobe in the mental rotation in relation to the superior parietal. The SVM also identified a network of regions that had a decrease in BOLD responses during the 100 degrees condition in relation to the 0 degrees condition (posterior cingulate, frontal, and superior temporal gyrus). This network was also highly discriminating between the two conditions. In addition, we investigated changes in functional connectivity between the most discriminating areas identified by the spatio-temporal SVM. We observed an increase in functional connectivity between almost all areas activated during the 100 degrees condition (bilateral inferior and superior parietal lobe, bilateral premotor area, and SMA) but not between the areas that showed a decrease in BOLD response during the 100 degrees condition.
Resumo:
A síndrome do X Frágil é a causa mais frequente de deficiência intelectual hereditária. A variante de Dandy-Walker trata-se de uma constelação específica de achados neurorradiológicos. Este estudo relata achados da comunicação oral e escrita de um menino de 15 anos com diagnóstico clínico e molecular da síndrome do X-Frágil e achados de neuroimagem do encéfalo compatíveis com variante de Dandy-Walker. A avaliação fonoaudiológica foi realizada por meio da Observação do Comportamento Comunicativo, aplicação do ABFW - Teste de Linguagem Infantil - Fonologia, Perfil de Habilidades Fonológicas, Teste de Desempenho Escolar, Teste Illinois de Habilidades Psicolinguísticas, avaliação do sistema estomatognático e avaliação audiológica. Observou-se: alteração de linguagem oral quanto às habilidades fonológicas, semânticas, pragmáticas e morfossintáticas; déficits nas habilidades psicolinguísticas (recepção auditiva, expressão verbal, combinação de sons, memória sequencial auditiva e visual, closura auditiva, associação auditiva e visual); e alterações morfológicas e funcionais do sistema estomatognático. Na leitura verificou-se dificuldades na decodificação dos símbolos gráficos e na escrita havia omissões, aglutinações e representações múltiplas com o uso predominante de vogais e dificuldades na organização viso-espacial. Em matemática, apesar do reconhecimento numérico, não realizou operações aritméticas. Não foram observadas alterações na avaliação audiológica periférica. A constelação de sintomas comportamentais, cognitivos, linguísticos e perceptivos, previstos na síndrome do X-Frágil, somada às alterações estruturais do sistema nervoso central, pertencentes à variante de Dandy-Walker, trouxeram interferências marcantes no desenvolvimento das habilidades comunicativas, no aprendizado da leitura e escrita e na integração social do indivíduo.
Resumo:
A redução da disponibilidade de espécies de madeiras nativas e seus efeitos na economia, associada ao fortalecimento dos conceitos de preservação ambiental, criou a necessidade de desenvolvimento de alternativas viáveis para utilização racional de espécies de reflorestamento. E uma das opções é a realização de classificação visual das peças. Autores de trabalhos desenvolvidos nessa linha de pesquisa verificaram a adequação das regras de classificação visual do Southern Pine Inspection Bureau (SPIB) dos EUA à madeira de Pinus do Brasil e apresentaram proposta para normalizar o processo de classificação visual dessa madeira. Nessa classificação, os aspectos com maior influência são: presença de nós, desvio de grã em relação ao eixo da peça e densidade de anéis de crescimento. Assim, esta pesquisa apresenta um estudo experimental que consistiu na classificação visual e determinação da resistência à tração de 85 peças de Pinus spp e um estudo teórico, que propôs uma equação para determinar a resistência à tração média de peças estruturais em função da classificação visual. Com este trabalho, foi possível observar a influência dos nós e dos anéis de crescimento sobre a resistência à tração das peças analisadas.
Resumo:
A avaliação da dor em animais necessita da utilização de escalas de avaliação, que dependem da interpretação realizada por observadores. O objetivo do presente estudo foi avaliar a correlação entre a escala visual analógica (EVA), escala de Melbourne e os filamentos de Von Frey, na avaliação da dor pós-operatória em 42 cadelas adultas e saudáveis, submetidas à ovariossalpingohisterectomia (OSH). A dor pós-operatória foi avaliada por dois observadores cegos aos tratamentos analgésicos, em intervalos de uma hora, utilizando a EVA, a escala de Melbourne e os filamentos de Von Frey, aplicados ao redor da incisão cirúrgica. Foram considerados como critérios para realização da analgesia resgate uma pontuação de 50mm na EVA ou de 13 pontos na escala de Melbourne. A EVA revelou-se a escala mais sensível, uma vez que 100% dos animais receberam resgate seguindo esse método. Os valores obtidos na EVA e na escala de Melbourne determinaram boa correlação, com r=0,74, o que não ocorreu com os filamentos de Von Frey (r=-0,18). Já a correlação entre a escala de Melbourne e os filamentos de Von Frey foi de -0.37. Apesar de a EVA e a escala de Melbourne apresentarem boa correlação, sugere-se que se considere uma pontuação menor na escala de Melbourne como critério para administração de analgesia resgate.
Resumo:
A modified version of the intruder-resident paradigm was used to investigate if social recognition memory lasts at least 24 h. One hundred and forty-six adult male Wistar rats were used. Independent groups of rats were exposed to an intruder for 0.083, 0.5, 2, 24, or 168 h and tested 24 h after the first encounter with the familiar or a different conspecific. Factor analysis was employed to identify associations between behaviors and treatments. Resident rats exhibited a 24-h social recognition memory, as indicated by a 3- to 5-fold decrease in social behaviors in the second encounter with the same conspecific compared to those observed for a different conspecific, when the duration of the first encounter was 2 h or longer. It was possible to distinguish between two different categories of social behaviors and their expression depended on the duration of the first encounter. Sniffing the anogenital area (49.9% of the social behaviors), sniffing the body (17.9%), sniffing the head (3%), and following the conspecific (3.1%), exhibited mostly by resident rats, characterized social investigation and revealed long-term social recognition memory. However, dominance (23.8%) and mild aggression (2.3%), exhibited by both resident and intruders, characterized social agonistic behaviors and were not affected by memory. Differently, sniffing the environment (76.8% of the non-social behaviors) and rearing (14.3%), both exhibited mostly by adult intruder rats, characterized non-social behaviors. Together, these results show that social recognition memory in rats may last at least 24 h after a 2-h or longer exposure to the conspecific.
Resumo:
PURPOSE: To report a new, direct visual approach for rat pinealectomy. METHODS: Eighty adult female rats (Rattus norvegicus albinus EPM-1 strain) were weighted and anesthetized intraperitoneally with 15 mg/kg xylazine and 30 mg/kg ketamine. The animal was fastened to a dissection table, an incision was made in the skin and the subcutaneous tissue, bringing the lambda into view. The skullcap was opened with a dental drill, bringing the cerebral hemispheres and the superior sagittal sinus into view. The pineal gland, located under the venous sinus, was removed in a single piece using tweezers. Next, the bone fragment was returned to its place and the surgical layers were sutured. RESULTS: This new technique is easy to be done, avoids bleedings and removes only the pineal gland without damage to the remaining encephalon. In addition it makes possible the achievement of a sham surgery, allowing the pineal gland to remain intact. CONCLUSION: The proposed technique intends to facilitate studies aiming to better understanding the complexity and importance of the pineal gland on reproductive and other body systems.
Resumo:
Testing contexts have been shown to critically influence experimental results in psychophysical studies. One of these contexts that show important modulation of the behavioral effects of different stimulatory conditions is the separate (blocked) or mixed presentation of these stimulatory conditions. The study presents evidence that the apparent discriminabilities of two target stimuli can change according to which of these two testing contexts is used. A cross inside a ring and a vertical line inside a ring were presented as go stimuli in a go/no-go reaction time task. In one experiment, each of these stimuli was presented to a different group of volunteers and in another experiment they were presented to the same group of volunteers, randomly mixed in the blocks of trials. Similar reaction times were obtained for the two stimuli in the first experiment, and different reaction times (faster for the cross) in the second experiment. The latter result indicates that the two stimuli have different discriminabilities from the no-go stimulus; the cross having greater discriminability. This difference is however masked, presumably by the adoption of specific compensatory attentional sets, in a separate testing context.
Resumo:
OBJETIVO: Desenvolver um método e um dispositivo para quantificar a visão em candela (cd). Os estudos de medida da visão são importantes para todas as ciências visuais. MÉTODOS: É um estudo teórico e experimental. Foram descritos os detalhes do método psicofísico e da calibração do dispositivo. Foram realizados testes preliminares em voluntários. RESULTADOS: É um teste psicofísico simples e com resultado expresso em unidades do sistema internacional de medidas. Com a descrição técnica será possível reproduzir o experimento em outros centros de pesquisa. CONCLUSÃO: Os resultados aferidos em intensidade luminosa (cd) são uma opção para estudo visual. Esses resultados possibilitarão extrapolar medidas para modelos matemáticos e para simular efeitos individuais com dados aberrométricos.
Resumo:
We measured the effects of epilepsy on visual contrast sensitivity to linear and vertical sine-wave gratings. Sixteen female adults, aged 21 to 50 years, comprised the sample in this study, including eight adults with generalized tonic-clonic seizure-type epilepsy and eight age-matched controls without epilepsy. Contrast threshold was measured using a temporal two-alternative forced-choice binocular psychophysical method at a distance of 150 cm from the stimuli, with a mean luminance of 40.1 cd/m². A one-way analysis of variance (ANOVA) applied to the linear contrast threshold showed significant differences between groups (F[3,188] = 14.829; p < .05). Adults with epilepsy had higher contrast thresholds (1.45, 1.04, and 1.18 times for frequencies of 0.25, 2.0, and 8.0 cycles per degree of visual angle, respectively). The Tukey Honestly Significant Difference post hoc test showed significant differences (p < .05) for all of the tested spatial frequencies. The largest difference between groups was in the lowest spatial frequency. Therefore, epilepsy may cause more damage to the neural pathways that process low spatial frequencies. However, epilepsy probably alters both the magnocellular visual pathway, which processes low spatial frequencies, and the parvocellular visual pathway, which processes high spatial frequencies. The experimental group had lower visual contrast sensitivity to all tested spatial frequencies.
Resumo:
A promoção da inclusão escolar de pessoas com deficiência visual demanda que os profissionais conheçam as percepções que estes alunos têm a respeito de suas limitações e possibilidades. Neste estudo, foram identificadas características e percepções de escolares com deficiência visual em relação ao seu processo de reabilitação. Foi realizado um estudo descritivo transversal com escolares de 12 anos e mais, inseridos no sistema público de um município do Estado de São Paulo. Aplicou-se questionário mediante entrevista. Obteve-se população de 26 alunos, sendo 46,2% com visão subnormal e 53,8% com cegueira, com média de idade de 17,1 anos. A repetência escolar foi declarada por 73,1%. Entre as dificuldades escolares decorrentes da cegueira, sobressaiu-se a leitura de livros didáticos e, entre as decorrentes da visão subnormal, a visualização da lousa. O nível de escolaridade mostrou-se baixo em relação à média de idade. Evidenciaram-se percepções coerentes em relação à problemática da inclusão escolar.