791 resultados para audio recording


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Audio feedback remains little used in most graphical user interfaces despite its potential to greatly enhance interaction. Not only does sonic enhancement of interfaces permit more natural human-computer communication but it also allows users to employ an appropriate sense to solve a problem rather than having to rely solely on vision. Research shows that designers do not typically know how to use sound effectively; subsequently, their ad hoc use of sound often leads to audio feedback being considered an annoying distraction. Unlike the design of purely graphical user interfaces for which guidelines are common, the audio-enhancement of graphical user interfaces has (until now) been plagued by a lack of suitable guidance. This paper presents a series of empirically substantiated guidelines for the design and use of audio-enhanced graphical user interface widgets.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A newly released commercial autorefractor, the Shin-Nippon SRW-5000 (Japan), has been found to be valid compared to subjective refraction and repeatable over a wide prescription range. Its binocular open field-of-view allows the accommodative state to be monitored while a natural environment is viewed. In conventional static mode, the device can take up to 45 readings in 1min using digital image analysis of the reflected retinal image of a measurement ring. Continuous on-line analysis of the ring provides high (up to 60Hz) temporal resolution of the refractive state to an accuracy of <0.001D. Pupil size can also be analysed to a resolution of <0.001mm. The measurement of accommodation and pupil size was relatively unaffected by eccentricity of viewing up to ±10° and instrument focusing inaccuracies of ±5mm. The resolution properties of the analysis are shown to be ideal for measurement of dynamic accommodation and pupil responses. Copyright © 2001 The College of Optometrists.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The paper presents our considerations related to the creation of a digital corpus of Bulgarian dialects. The dialectological archive of Bulgarian language consists of more than 250 audio tapes. All tapes were recorded between 1955 and 1965 in the course of regular dialectological expeditions throughout the country. The records typically contain interviews with inhabitants of small villages in Bulgaria. The topics covered are usually related to such issues as birth, everyday life, marriage, family relationship, death, etc. Only a few tapes contain folk songs from different regions of the country. Taking into account the progressive deterioration of the magnetic media and the realistic prospects of data loss, the Institute for Bulgarian Language at the Academy of Sciences launched in 1997 a project aiming at restoration and digital preservation of the dialectological archive. Within the framework of this project more than the half of the records was digitized, de-noised and stored on digital recording media. Since then restoration and digitization activities are done in the Institute on a regular basis. As a result a large collection of sound files has been gathered. Our further efforts are aimed at the creation of a digital corpus of Bulgarian dialects, which will be made available for phonological and linguistic research. Such corpora typically include besides the sound files two basic elements: a transcription, aligned with the sound file, and a set of standardized metadata that defines the corpus. In our work we will present considerations on how these tasks could be realized in the case of the corpus of Bulgarian dialects. Our suggestions will be based on a comparative analysis of existing methods and techniques to build such corpora, and by selecting the ones that fit closer to the particular needs. Our experience can be used in similar institutions storing folklore archives, history related spoken records etc.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this report we summarize the state-of-the-art of speech emotion recognition from the signal processing point of view. On the bases of multi-corporal experiments with machine-learning classifiers, the observation is made that existing approaches for supervised machine learning lead to database dependent classifiers which can not be applied for multi-language speech emotion recognition without additional training because they discriminate the emotion classes following the used training language. As there are experimental results showing that Humans can perform language independent categorisation, we made a parallel between machine recognition and the cognitive process and tried to discover the sources of these divergent results. The analysis suggests that the main difference is that the speech perception allows extraction of language independent features although language dependent features are incorporated in all levels of the speech signal and play as a strong discriminative function in human perception. Based on several results in related domains, we have suggested that in addition, the cognitive process of emotion-recognition is based on categorisation, assisted by some hierarchical structure of the emotional categories, existing in the cognitive space of all humans. We propose a strategy for developing language independent machine emotion recognition, related to the identification of language independent speech features and the use of additional information from visual (expression) features.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Aims: To survey eye care practitioners from around the world regarding their current practice for anterior eye health recording to inform guidelines on best practice. Methods: The on-line survey examined the reported use of: word descriptions, sketching, grading scales or photographs; paper or computerised record cards and whether these were guided by proforma headings; grading scale choice, signs graded, level of precision, regional grading; and how much time eye care practitioners spent on average on anterior eye health recording. Results: Eight hundred and nine eye care practitioners from across the world completed the survey. Word description (p <. 0.001), sketches (p = 0.002) and grading scales (p <. 0.001) were used more for recording the anterior eye health of contact lens patients than other patients, but photography was used similarly (p = 0.132). Of the respondents, 84.5% used a grading scale, 13.5% using two, with the original Efron (51.6%) and CCLRU/Brien-Holden-Vision-Institute (48.5%) being the most popular. The median features graded was 11 (range 1-23), frequency from 91.6% (bulbar hyperaemia) to 19.6% (endothelial blebs), with most practitioners grading to the nearest unit (47.4%) and just 14.7% to one decimal place. The average time taken to report anterior eye health was reported to be 6.8. ±. 5.7. min, with the maximum time available 14.0. ±. 11. min. Conclusions: Developed practice and research evidence allows best practice guidelines for anterior eye health recording to be recommended. It is recommended to: record which grading scale is used; always grade to one decimal place, record what you see live rather than based on how you intend to manage a condition; grade bulbar and limbal hyperaemia, limbal neovascularisation, conjunctival papillary redness and roughness (in white light to assess colouration with fluorescein instilled to aid visualisation of papillae/follicles), blepharitis, meibomian gland dysfunction and sketch staining (both corneal and conjunctival) at every visit. Record other anterior eye features only if they are remarkable, but indicate that the key tissue which have been examined.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

AMS Subj. Classification: H.3.7 Digital Libraries, K.6.5 Security and Protection

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Long term recording of biomedical signals such as ECG, EMG, respiration and other information (e.g. body motion) can improve diagnosis and potentially monitor the evolution of many widespread diseases. However, long term monitoring requires specific solutions, portable and wearable equipment that should be particularly comfortable for patients. The key-issues of portable biomedical instrumentation are: power consumption, long-term sensor stability, comfortable wearing and wireless connectivity. In this scenario, it would be valuable to realize prototypes using available technologies to assess long-term personal monitoring and foster new ways to provide healthcare services. The aim of this work is to discuss the advantages and the drawbacks in long term monitoring of biopotentials and body movements using textile electrodes embedded in clothes. The textile electrodes were embedded into garments; tiny shirt and short were used to acquire electrocardiographic and electromyographic signals. The garment was equipped with low power electronics for signal acquisition and data wireless transmission via Bluetooth. A small, battery powered, biopotential amplifier and three-axes acceleration body monitor was realized. Patient monitor incorporates a microcontroller, analog-to-digital signal conversion at programmable sampling frequencies. The system was able to acquire and to transmit real-time signals, within 10 m range, to any Bluetooth device (including PDA or cellular phone). The electronics were embedded in the shirt resulting comfortable to wear for patients. Small size MEMS 3-axes accelerometers were also integrated. © 2011 IEEE.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In the digital age the internet and the ICT devices changed our daily life and routines. It means we couldn't live without these services and devices anywhere (work, home, holiday, etc.). It can be experienced in the tourism sector; digital contents become key tools in the tourism of the 21st century; they will be able to adapt the traditional tourist guide methodology to the applications running on novel digital devices. Tourists belong to a new generation, an "ICT generation" using innovative tools, a new info-media to communicate. A possible direction for tourism development is to use modern ICT systems and devices. Besides participating in classical tours guided by travel guides, there is a new opportunity for individual tourists to enjoy high quality ICT based guided walks prepared on the knowledge of travel guides. The main idea of the GUIDE@HAND service is to use reusable, and create new tourism contents for an advanced mobile device, in order to give a contemporary answer to traditional systems of tourism information, by developing new tourism services based on digital contents for innovative mobile applications. The service is based on a new concept of enhancing territorial heritage and values, through knowledge, innovation, languages and multilingual solutions going along with new tourists‟ “sensitiveness”.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

It is well established that accent recognition can be as accurate as up to 95% when the signals are noise-free, using feature extraction techniques such as mel-frequency cepstral coefficients and binary classifiers such as discriminant analysis, support vector machine and k-nearest neighbors. In this paper, we demonstrate that the predictive performance can be reduced by as much as 15% when the signals are noisy. Specifically, in this paper we perturb the signals with different levels of white noise, and as the noise become stronger, the out-of-sample predictive performance deteriorates from 95% to 80%, although the in-sample prediction gives overly-optimistic results. ACM Computing Classification System (1998): C.3, C.5.1, H.1.2, H.2.4., G.3.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper we discuss how an innovative audio-visual project was adopted to foster active, rather than declarative learning, in critical International Relations (IR). First, we explore the aesthetic turn in IR, to contrast this with forms of representation that have dominated IR scholarship. Second, we describe how students were asked to record short audio or video projects to explore their own insights through aesthetic and non-written formats. Third, we explain how these projects are understood to be deeply embedded in social science methodologies. We cite our inspiration from applying a personal sociological imagination, as a way to counterbalance a ‘marketised’ slant in higher education, in a global economy where students are often encouraged to consume, rather than produce knowledge. Finally, we draw conclusions in terms of deeper forms of student engagement leading to new ways of thinking and presenting new skills and new connections between theory and practice.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This work is the first work using patterned soft underlayers in multilevel three-dimensional vertical magnetic data storage systems. The motivation stems from an exponentially growing information stockpile, and a corresponding need for more efficient storage devices with higher density. The world information stockpile currently exceeds 150EB (ExaByte=1x1018Bytes); most of which is in analog form. Among the storage technologies (semiconductor, optical and magnetic), magnetic hard disk drives are posed to occupy a big role in personal, network as well as corporate storage. However; this mode suffers from a limit known as the Superparamagnetic limit; which limits achievable areal density due to fundamental quantum mechanical stability requirements. There are many viable techniques considered to defer superparamagnetism into the 100's of Gbit/in2 such as: patterned media, Heat-Assisted Magnetic Recording (HAMR), Self Organized Magnetic Arrays (SOMA), antiferromagnetically coupled structures (AFC), and perpendicular magnetic recording. Nonetheless, these techniques utilize a single magnetic layer; and can thusly be viewed as two-dimensional in nature. In this work a novel three-dimensional vertical magnetic recording approach is proposed. This approach utilizes the entire thickness of a magnetic multilayer structure to store information; with potential areal density well into the Tbit/in2 regime. ^ There are several possible implementations for 3D magnetic recording; each presenting its own set of requirements, merits and challenges. The issues and considerations pertaining to the development of such systems will be examined, and analyzed using empirical and numerical analysis techniques. Two novel key approaches are proposed and developed: (1) Patterned soft underlayer (SUL) which allows for enhanced recording of thicker media, (2) A combinatorial approach for 3D media development that facilitates concurrent investigation of various film parameters on a predefined performance metric. A case study is presented using combinatorial overcoats of Tantalum and Zirconium Oxides for corrosion protection in magnetic media. ^ Feasibility of 3D recording is demonstrated, and an emphasis on 3D media development is emphasized as a key prerequisite. Patterned SUL shows significant enhancement over conventional "un-patterned" SUL, and shows that geometry can be used as a design tool to achieve favorable field distribution where magnetic storage and magnetic phenomena are involved. ^

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Digital systems can generate left and right audio channels that create the effect of virtual sound source placement (spatialization) by processing an audio signal through pairs of Head-Related Transfer Functions (HRTFs) or, equivalently, Head-Related Impulse Responses (HRIRs). The spatialization effect is better when individually-measured HRTFs or HRIRs are used than when generic ones (e.g., from a mannequin) are used. However, the measurement process is not available to the majority of users. There is ongoing interest to find mechanisms to customize HRTFs or HRIRs to a specific user, in order to achieve an improved spatialization effect for that subject. Unfortunately, the current models used for HRTFs and HRIRs contain over a hundred parameters and none of those parameters can be easily related to the characteristics of the subject. This dissertation proposes an alternative model for the representation of HRTFs, which contains at most 30 parameters, all of which have a defined functional significance. It also presents methods to obtain the value of parameters in the model to make it approximately equivalent to an individually-measured HRTF. This conversion is achieved by the systematic deconstruction of HRIR sequences through an augmented version of the Hankel Total Least Squares (HTLS) decomposition approach. An average 95% match (fit) was observed between the original HRIRs and those re-constructed from the Damped and Delayed Sinusoids (DDSs) found by the decomposition process, for ipsilateral source locations. The dissertation also introduces and evaluates an HRIR customization procedure, based on a multilinear model implemented through a 3-mode tensor, for mapping of anatomical data from the subjects to the HRIR sequences at different sound source locations. This model uses the Higher-Order Singular Value Decomposition (HOSVD) method to represent the HRIRs and is capable of generating customized HRIRs from easily attainable anatomical measurements of a new intended user of the system. Listening tests were performed to compare the spatialization performance of customized, generic and individually-measured HRIRs when they are used for synthesized spatial audio. Statistical analysis of the results confirms that the type of HRIRs used for spatialization is a significant factor in the spatialization success, with the customized HRIRs yielding better results than generic HRIRs.