25 resultados para multi-modal interaction

em CentAUR: Central Archive University of Reading - UK


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Context-aware multimodal interactive systems aim to adapt to the needs and behavioural patterns of users and offer a way forward for enhancing the efficacy and quality of experience (QoE) in human-computer interaction. The various modalities that constribute to such systems each provide a specific uni-modal response that is integratively presented as a multi-modal interface capable of interpretation of multi-modal user input and appropriately responding to it through dynamically adapted multi-modal interactive flow management , This paper presents an initial background study in the context of the first phase of a PhD research programme in the area of optimisation of data fusion techniques to serve multimodal interactivite systems, their applications and requirements.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Since the advent of the internet in every day life in the 1990s, the barriers to producing, distributing and consuming multimedia data such as videos, music, ebooks, etc. have steadily been lowered for most computer users so that almost everyone with internet access can join the online communities who both produce, consume and of course also share media artefacts. Along with this trend, the violation of personal data privacy and copyright has increased with illegal file sharing being rampant across many online communities particularly for certain music genres and amongst the younger age groups. This has had a devastating effect on the traditional media distribution market; in most cases leaving the distribution companies and the content owner with huge financial losses. To prove that a copyright violation has occurred one can deploy fingerprinting mechanisms to uniquely identify the property. However this is currently based on only uni-modal approaches. In this paper we describe some of the design challenges and architectural approaches to multi-modal fingerprinting currently being examined for evaluation studies within a PhD research programme on optimisation of multi-modal fingerprinting architectures. Accordingly we outline the available modalities that are being integrated through this research programme which aims to establish the optimal architecture for multi-modal media security protection over the internet as the online distribution environment for both legal and illegal distribution of media products.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Fingerprinting is a well known approach for identifying multimedia data without having the original data present but what amounts to its essence or ”DNA”. Current approaches show insufficient deployment of three types of knowledge that could be brought to bear in providing a finger printing framework that remains effective, efficient and can accommodate both the whole as well as elemental protection at appropriate levels of abstraction to suit various Foci of Interest (FoI) in an image or cross media artefact. Thus our proposed framework aims to deliver selective composite fingerprinting that remains responsive to the requirements for protection of whole or parts of an image which may be of particularly interest and be especially vulnerable to attempts at rights violation. This is powerfully aided by leveraging both multi-modal information as well as a rich spectrum of collateral context knowledge including both image-level collaterals as well as the inevitably needed market intelligence knowledge such as customers’ social networks interests profiling which we can deploy as a crucial component of our Fingerprinting Collateral Knowledge. This is used in selecting the special FoIs within an image or other media content that have to be selectively and collaterally protected.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Fingerprinting is a well known approach for identifying multimedia data without having the original data present but instead what amounts to its essence or 'DNA'. Current approaches show insufficient deployment of various types of knowledge that could be brought to bear in providing a fingerprinting framework that remains effective, efficient and can accommodate both the whole as well as elemental protection at appropriate levels of abstraction to suit various Zones of Interest (ZoI) in an image or cross media artefact. The proposed framework aims to deliver selective composite fingerprinting that is powerfully aided by leveraging both multi-modal information as well as a rich spectrum of collateral context knowledge including both image-level collaterals and also the inevitably needed market intelligence knowledge such as customers' social networks interests profiling which we can deploy as a crucial component of our fingerprinting collateral knowledge.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Awareness of emerging situations in a dynamic operational environment of a robotic assistive device is an essential capability of such a cognitive system, based on its effective and efficient assessment of the prevailing situation. This allows the system to interact with the environment in a sensible (semi)autonomous / pro-active manner without the need for frequent interventions from a supervisor. In this paper, we report a novel generic Situation Assessment Architecture for robotic systems directly assisting humans as developed in the CORBYS project. This paper presents the overall architecture for situation assessment and its application in proof-of-concept Demonstrators as developed and validated within the CORBYS project. These include a robotic human follower and a mobile gait rehabilitation robotic system. We present an overview of the structure and functionality of the Situation Assessment Architecture for robotic systems with results and observations as collected from initial validation on the two CORBYS Demonstrators.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Methods of approaching the study of discourse have developed rapidly in the last ten years, influenced by a growing interdisciplinary spirit among linguistics and anthropology, sociology, cognitive and cultural psychology and cultural studies, as well as among established sub-fields within linguistics itself. Among the more recent developments are an increasing ‘critical’ turn in discourse analysis, a growing interest in historical, ethnographic and corpus-based approaches to discourse, more concern with the social contexts in which discourse occurs, the social actions that it is used to take and the identities that are constructed through it, as well as a revaluation of what counts as ‘discourse’ to include multi-modal texts and interaction. Advances in Discourse Studies brings together contributions from leading scholars in the field, investigating the historical and theoretical relationships between new advances in discourse studies and pointing towards new directions for the future of the discipline. Featuring discussion questions, classroom projects and recommended readings at the end of each section, as well as case studies illustrating each approach discussed, this is an invaluable resource for students of interdisciplinary discourse analysis.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

We present results from fast-response wind measurements within and above a busy intersection between two street canyons (Marylebone Road and Gloucester Place) in Westminster, London taken as part of the DAPPLE (Dispersion of Air Pollution and Penetration into the Local Environment; www.dapple.org.uk) 2007 field campaign. The data reported here were collected using ultrasonic anemometers on the roof-top of a building adjacent to the intersection and at two heights on a pair of lamp-posts on opposite sides of the intersection. Site characteristics, data analysis and the variation of intersection flow with the above-roof wind direction (θref) are discussed. Evidence of both flow channelling and recirculation was identified within the canyon, only a few metres from the intersection for along-street and across-street roof-top winds respectively. Results also indicate that for oblique rooftop flows, the intersection flow is a complex combination of bifurcated channelled flows, recirculation and corner vortices. Asymmetries in local building geometry around the intersection and small changes in the background wind direction (changes in 15-min mean θref of 5–10 degrees) were also observed to have profound influences on the behaviour of intersection flow patterns. Consequently, short time-scale variability in the background flow direction can lead to highly scattered in-street mean flow angles masking the true multi-modal features of the flow and thus further complicating modelling challenges.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Eye gaze is an important conversational resource that until now could only be supported across a distance if people were rooted to the spot. We introduce EyeCVE, the worldpsilas first tele-presence system that allows people in different physical locations to not only see what each other are doing but follow each otherpsilas eyes, even when walking about. Projected into each space are avatar representations of remote participants, that reproduce not only body, head and hand movements, but also those of the eyes. Spatial and temporal alignment of remote spaces allows the focus of gaze as well as activity and gesture to be used as a resource for non-verbal communication. The temporal challenge met was to reproduce eye movements quick enough and often enough to interpret their focus during a multi-way interaction, along with communicating other verbal and non-verbal language. The spatial challenge met was to maintain communicational eye gaze while allowing free movement of participants within a virtually shared common frame of reference. This paper reports on the technical and especially temporal characteristics of the system.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Many techniques are currently used for motion estimation. In the block-based approaches the most common procedure applied is the block-matching based on various algorithms. To refine the motion estimates resulting from the full search or any coarse search algorithm, one can find few applications of Kalman filtering, mainly in the intraframe scheme. The Kalman filtering technique applicability for block-based motion estimation is rather limited due to discontinuities in the dynamic behaviour of the motion vectors. Therefore, we propose an application of the concept of the filtering by approximated densities (FAD). The FAD, originally introduced to alleviate limitations due to conventional Kalman modelling, is applied to interframe block-motion estimation. This application uses a simple form of FAD involving statistical characteristics of multi-modal distributions up to second order.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This paper examines biogas innovation system and processes in two farming communities in Davao del Sur, Philippines. Innovation histories were traced through workshops, semi-structured interviews, observations and document analysis. The paper shows that there were diverse innovation actors both from public and private sectors. Restrictive attitudes and practices resulted in weak and limited interactions among actors. Multi-actor interaction was weak, signifying a lack of innovation actors that focus on creating, developing and strengthening linkages, networks and partnerships. The lack of support in the socio-organisational institutions that constitute the enabling environment within which innovation actors operate may lead to systemic failure.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Brain activity can be measured non-invasively with functional imaging techniques. Each pixel in such an image represents a neural mass of about 105 to 107 neurons. Mean field models (MFMs) approximate their activity by averaging out neural variability while retaining salient underlying features, like neurotransmitter kinetics. However, MFMs incorporating the regional variability, realistic geometry and connectivity of cortex have so far appeared intractable. This lack of biological realism has led to a focus on gross temporal features of the EEG. We address these impediments and showcase a "proof of principle" forward prediction of co-registered EEG/fMRI for a full-size human cortex in a realistic head model with anatomical connectivity, see figure 1. MFMs usually assume homogeneous neural masses, isotropic long-range connectivity and simplistic signal expression to allow rapid computation with partial differential equations. But these approximations are insufficient in particular for the high spatial resolution obtained with fMRI, since different cortical areas vary in their architectonic and dynamical properties, have complex connectivity, and can contribute non-trivially to the measured signal. Our code instead supports the local variation of model parameters and freely chosen connectivity for many thousand triangulation nodes spanning a cortical surface extracted from structural MRI. This allows the introduction of realistic anatomical and physiological parameters for cortical areas and their connectivity, including both intra- and inter-area connections. Proper cortical folding and conduction through a realistic head model is then added to obtain accurate signal expression for a comparison to experimental data. To showcase the synergy of these computational developments, we predict simultaneously EEG and fMRI BOLD responses by adding an established model for neurovascular coupling and convolving "Balloon-Windkessel" hemodynamics. We also incorporate regional connectivity extracted from the CoCoMac database [1]. Importantly, these extensions can be easily adapted according to future insights and data. Furthermore, while our own simulation is based on one specific MFM [2], the computational framework is general and can be applied to models favored by the user. Finally, we provide a brief outlook on improving the integration of multi-modal imaging data through iterative fits of a single underlying MFM in this realistic simulation framework.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Most prominent models of bilingual representation assume a degree of interconnection or shared representation at the conceptual level. However, in the context of linguistic and cultural specificity of human concepts, and given recent findings that reveal a considerable amount of bidirectional conceptual transfer and conceptual change in bilinguals, a particular challenge that bilingual models face is to account for non-equivalence or partial equivalence of L1 and L2 specific concepts in bilingual conceptual store. The aim of the current paper is to provide a state-of-the-art review of the available empirical evidence from the fields of psycholinguistics, cognitive, experimental, and cross-cultural psychology, and discuss how these may inform and develop further traditional and more recent accounts of bilingual conceptual representation. Based on a synthesis of the available evidence against theoretical postulates of existing models, I argue that the most coherent account of bilingual conceptual representation combines three fundamental assumptions. The first one is the distributed, multi-modal nature of representation. The second one concerns cross-linguistic and cross-cultural variation of concepts. The third one makes assumptions about the development of concepts, and the emergent links between those concepts and their linguistic instantiations.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The feedback mechanism used in a brain-computer interface (BCI) forms an integral part of the closed-loop learning process required for successful operation of a BCI. However, ultimate success of the BCI may be dependent upon the modality of the feedback used. This study explores the use of music tempo as a feedback mechanism in BCI and compares it to the more commonly used visual feedback mechanism. Three different feedback modalities are compared for a kinaesthetic motor imagery BCI: visual, auditory via music tempo, and a combined visual and auditory feedback modality. Visual feedback is provided via the position, on the y-axis, of a moving ball. In the music feedback condition, the tempo of a piece of continuously generated music is dynamically adjusted via a novel music-generation method. All the feedback mechanisms allowed users to learn to control the BCI. However, users were not able to maintain as stable control with the music tempo feedback condition as they could in the visual feedback and combined conditions. Additionally, the combined condition exhibited significantly less inter-user variability, suggesting that multi-modal feedback may lead to more robust results. Finally, common spatial patterns are used to identify participant-specific spatial filters for each of the feedback modalities. The mean optimal spatial filter obtained for the music feedback condition is observed to be more diffuse and weaker than the mean spatial filters obtained for the visual and combined feedback conditions.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Characterization of neural and hemodynamic biomarkers of epileptic activity that can be measured using noninvasive techniques is fundamental to the accurate identification of the epileptogenic zone (EZ) in the clinical setting. Recently, oscillations at gamma-band frequencies and above (N30 Hz) have been suggested to provide valuable localizing information of the EZ and track cortical activation associated with epileptogenic processes. Although a tight coupling between gamma-band activity and hemodynamic-based signals has been consistently demonstrated in non-pathological conditions, very little is known about whether such a relationship is maintained in epilepsy and the laminar etiology of these signals. Confirmation of this relationship may elucidate the underpinnings of perfusion-based signals in epilepsy and the potential value of localizing the EZ using hemodynamic correlates of pathological rhythms. Here, we use concurrent multi-depth electrophysiology and 2- dimensional optical imaging spectroscopy to examine the coupling between multi-band neural activity and cerebral blood volume (CBV) during recurrent acute focal neocortical seizures in the urethane-anesthetized rat. We show a powerful correlation between gamma-band power (25–90 Hz) and CBV across cortical laminae, in particular layer 5, and a close association between gamma measures and multi-unit activity (MUA). Our findings provide insights into the laminar electrophysiological basis of perfusion-based imaging signals in the epileptic state and may have implications for further research using non-invasive multi-modal techniques to localize epileptogenic tissue