136 resultados para audio features
Resumo:
This paper introduces a novel interface designed to help blind and visually impaired people to explore and navigate on the Web. In contrast to traditionally used assistive tools, such as screen readers and magnifiers, the new interface employs a combination of both audio and haptic features to provide spatial and navigational information to users. The haptic features are presented via a low-cost force feedback mouse allowing blind people to interact with the Web, in a similar fashion to their sighted counterparts. The audio provides navigational and textual information through the use of non-speech sounds and synthesised speech. Interacting with the multimodal interface offers a novel experience to target users, especially to those with total blindness. A series of experiments have been conducted to ascertain the usability of the interface and compare its performance to that of a traditional screen reader. Results have shown the advantages that the new multimodal interface offers blind and visually impaired people. This includes the enhanced perception of the spatial layout of Web pages, and navigation towards elements on a page. Certain issues regarding the design of the haptic and audio features raised in the evaluation are discussed and presented in terms of recommendations for future work.
Resumo:
The Audio/Visual Emotion Challenge and Workshop (AVEC 2011) is the first competition event aimed at comparison of multimedia processing and machine learning methods for automatic audio, visual and audiovisual emotion analysis, with all participants competing under strictly the same conditions. This paper first describes the challenge participation conditions. Next follows the data used – the SEMAINE corpus – and its partitioning into train, development, and test partitions for the challenge with labelling in four dimensions, namely activity, expectation, power, and valence. Further, audio and video baseline features are introduced as well as baseline results that use these features for the three sub-challenges of audio, video, and audiovisual emotion recognition.
Resumo:
For the first time in this paper we present results showing the effect of speaker head pose angle on automatic lip-reading performance over a wide range of closely spaced angles. We analyse the effect head pose has upon the features themselves and show that by selecting coefficients with minimum variance w.r.t. pose angle, recognition performance can be improved when train-test pose angles differ. Experiments are conducted using the initial phase of a unique multi view Audio-Visual database designed specifically for research and development of pose-invariant lip-reading systems. We firstly show that it is the higher order horizontal spatial frequency components that become most detrimental as the pose deviates. Secondly we assess the performance of different feature selection masks across a range of pose angles including a new mask based on Minimum Cross-Pose Variance coefficients. We report a relative improvement of 50% in Word Error Rate when using our selection mask over a common energy based selection during profile view lip-reading.
Resumo:
Studies suggest that activation of phosphoinositide 3-kinase-Akt may protect against neuronal cell death in Alzheimer's disease (AD). Here, however, we provide evidence of increased Akt activation, and hyperphosphorylation of critical Akt substrates in AD brain, which link to AD pathogenesis, suggesting that treatments aiming to activate the pathway in AD need to be considered carefully. A different distribution of Akt and phospho-Akt was detected in AD temporal cortex neurons compared with control neurons, with increased levels of active phosphorylated-Akt in particulate fractions, and significant decreases in Akt levels in AD cytosolic fractions, causing increased activation of Akt (phosphorylated-Akt/total Akt ratio) in AD. In concordance, significant increases in the levels of phosphorylation of total Akt substrates, including: GSK3ßSer9, tauSer214, mTORSer2448, and decreased levels of the Akt target, p27kip1, were found in AD temporal cortex compared with controls. A significant loss and altered distribution of the major negative regulator of Akt, PTEN (phosphatase and tensin homologue deleted on chromosome 10), was also detected in AD neurons. Loss of phosphorylated-Akt and PTEN-containing neurons were found in hippocampal CA1 at end stages of AD. Taken together, these results support a potential role for aberrant control of Akt and PTEN signalling in AD.
Resumo:
Background Interferon ? receptor 1 (IFN? R1) deficiency is a primary immunodeficiency with allelic dominant and recessive mutations characterised clinically by severe infections with mycobacteria. We aimed to compare the clinical features of recessive and dominant IFN?R1 deficiencies. Methods We obtained data from a large cohort of patients worldwide. We assessed these people by medical histories, records, and genetic and immunological studies. Data were abstracted onto a standard form. Findings We identified 22 patients with recessive complete IFN?R1 deficiency and 38 with dominant partial deficiency. BCG and environmental mycobacteria were the most frequent pathogens. In recessive patients, 17 (77%) had environmental mycobacterial disease and all nine BCG-vaccinated patients had BCG disease. In dominant patients, 30 (79%) had environmental mycobacterial disease and 11 (73%) of 15 BCG-vaccinated patients had BCG disease. Compared with dominant patients, those with recessive deficiency were younger at onset of first environmental mycobacterial disease (mean 3·1 years [SD 2·5] vs 13·4 years [14·3], p=0·001), had more mycobacterial disease episodes (19 vs 8 per 100 person-years of observation, p=0·0001), had more severe mycobacterial disease (mean number of organs infected by Mycobacterium avium complex 4·1 [SD 0·8] vs 2·0 [1·1], p=0·004), had shorter mean disease-free intervals (1·6 years [SD 1·4] vs 7·2 years [7·6], p
Resumo:
Grey Level Co-occurrence Matrix (GLCM), one of the best known tool for texture analysis, estimates image properties related to second-order statistics. These image properties commonly known as Haralick texture features can be used for image classification, image segmentation, and remote sensing applications. However, their computations are highly intensive especially for very large images such as medical ones. Therefore, methods to accelerate their computations are highly desired. This paper proposes the use of programmable hardware to accelerate the calculation of GLCM and Haralick texture features. Further, as an example of the speedup offered by programmable logic, a multispectral computer vision system for automatic diagnosis of prostatic cancer has been implemented. The performance is then compared against a microprocessor based solution.