31 resultados para Audio-visual Speech Recognition, Visual Feature Extraction, Free-parts, Monolithic, ROI
em Université de Lausanne, Switzerland
Resumo:
Modern cochlear implantation technologies allow deaf patients to understand auditory speech; however, the implants deliver only a coarse auditory input and patients must use long-term adaptive processes to achieve coherent percepts. In adults with post-lingual deafness, the high progress of speech recovery is observed during the first year after cochlear implantation, but there is a large range of variability in the level of cochlear implant outcomes and the temporal evolution of recovery. It has been proposed that when profoundly deaf subjects receive a cochlear implant, the visual cross-modal reorganization of the brain is deleterious for auditory speech recovery. We tested this hypothesis in post-lingually deaf adults by analysing whether brain activity shortly after implantation correlated with the level of auditory recovery 6 months later. Based on brain activity induced by a speech-processing task, we found strong positive correlations in areas outside the auditory cortex. The highest positive correlations were found in the occipital cortex involved in visual processing, as well as in the posterior-temporal cortex known for audio-visual integration. The other area, which positively correlated with auditory speech recovery, was localized in the left inferior frontal area known for speech processing. Our results demonstrate that the visual modality's functional level is related to the proficiency level of auditory recovery. Based on the positive correlation of visual activity with auditory speech recovery, we suggest that visual modality may facilitate the perception of the word's auditory counterpart in communicative situations. The link demonstrated between visual activity and auditory speech perception indicates that visuoauditory synergy is crucial for cross-modal plasticity and fostering speech-comprehension recovery in adult cochlear-implanted deaf patients.
Resumo:
Understanding the basis on which recruiters form hirability impressions for a job applicant is a key issue in organizational psychology and can be addressed as a social computing problem. We approach the problem from a face-to-face, nonverbal perspective where behavioral feature extraction and inference are automated. This paper presents a computational framework for the automatic prediction of hirability. To this end, we collected an audio-visual dataset of real job interviews where candidates were applying for a marketing job. We automatically extracted audio and visual behavioral cues related to both the applicant and the interviewer. We then evaluated several regression methods for the prediction of hirability scores and showed the feasibility of conducting such a task, with ridge regression explaining 36.2% of the variance. Feature groups were analyzed, and two main groups of behavioral cues were predictive of hirability: applicant audio features and interviewer visual cues, showing the predictive validity of cues related not only to the applicant, but also to the interviewer. As a last step, we analyzed the predictive validity of psychometric questionnaires often used in the personnel selection process, and found that these questionnaires were unable to predict hirability, suggesting that hirability impressions were formed based on the interaction during the interview rather than on questionnaire data.
Resumo:
It has been demonstrated in earlier studies that patients with a cochlear implant have increased abilities for audio-visual integration because the crude information transmitted by the cochlear implant requires the persistent use of the complementary speech information from the visual channel. The brain network for these abilities needs to be clarified. We used an independent components analysis (ICA) of the activation (H2 (15) O) positron emission tomography data to explore occipito-temporal brain activity in post-lingually deaf patients with unilaterally implanted cochlear implants at several months post-implantation (T1), shortly after implantation (T0) and in normal hearing controls. In between-group analysis, patients at T1 had greater blood flow in the left middle temporal cortex as compared with T0 and normal hearing controls. In within-group analysis, patients at T0 had a task-related ICA component in the visual cortex, and patients at T1 had one task-related ICA component in the left middle temporal cortex and the other in the visual cortex. The time courses of temporal and visual activities during the positron emission tomography examination at T1 were highly correlated, meaning that synchronized integrative activity occurred. The greater involvement of the visual cortex and its close coupling with the temporal cortex at T1 confirm the importance of audio-visual integration in more experienced cochlear implant subjects at the cortical level.
Resumo:
ABSTRACT This thesis is composed of two main parts. The first addressed the question of whether the auditory and somatosensory systems, like their visual counterpart, comprise parallel functional pathways for processing identity and spatial attributes (so-called `what' and `where' pathways, respectively). The second part examined the independence of control processes mediating task switching across 'what' and `where' pathways in the auditory and visual modalities. Concerning the first part, electrical neuroimaging of event-related potentials identified the spatio-temporal mechanisms subserving auditory (see Appendix, Study n°1) and vibrotactile (see Appendix, Study n°2) processing during two types of blocks of trials. `What' blocks varied stimuli in their frequency independently of their location.. `Where' blocks varied the same stimuli in their location independently of their frequency. Concerning the second part (see Appendix, Study n°3), a psychophysical task-switching paradigm was used to investigate the hypothesis that the efficacy of control processes depends on the extent of overlap between the neural circuitry mediating the different tasks at hand, such that more effective task preparation (and by extension smaller switch costs) is achieved when the anatomical/functional overlap of this circuitry is small. Performance costs associated with switching tasks and/or switching sensory modalities were measured. Tasks required the analysis of either the identity or spatial location of environmental objects (`what' and `where' tasks, respectively) that were presented either visually or acoustically on any given trial. Pretrial cues informed participants of the upcoming task, but not of the sensory modality. - In the audio-visual domain, the results showed that switch costs between tasks were significantly smaller when the sensory modality of the task switched versus when it repeated. In addition, switch costs between the senses were correlated only when the sensory modality of the task repeated across trials and not when it switched. The collective evidence not only supports the independence of control processes mediating task switching and modality switching, but also the hypothesis that switch costs reflect competitive interterence between neural circuits that in turn can be diminished when these neural circuits are distinct. - In the auditory and somatosensory domains, the findings show that a segregation of location vs. recognition information is observed across sensory systems and that these happen around 100ms for both sensory modalities. - Also, our results show that functionally specialized pathways for audition and somatosensation involve largely overlapping brain regions, i.e. posterior superior and middle temporal cortices and inferior parietal areas. Both these properties (synchrony of differential processing and overlapping brain regions) probably optimize the relationships across sensory modalities. - Therefore, these results may be indicative of a computationally advantageous organization for processing spatial anal identity information.
Resumo:
This paper presents general problems and approaches for the spatial data analysis using machine learning algorithms. Machine learning is a very powerful approach to adaptive data analysis, modelling and visualisation. The key feature of the machine learning algorithms is that they learn from empirical data and can be used in cases when the modelled environmental phenomena are hidden, nonlinear, noisy and highly variable in space and in time. Most of the machines learning algorithms are universal and adaptive modelling tools developed to solve basic problems of learning from data: classification/pattern recognition, regression/mapping and probability density modelling. In the present report some of the widely used machine learning algorithms, namely artificial neural networks (ANN) of different architectures and Support Vector Machines (SVM), are adapted to the problems of the analysis and modelling of geo-spatial data. Machine learning algorithms have an important advantage over traditional models of spatial statistics when problems are considered in a high dimensional geo-feature spaces, when the dimension of space exceeds 5. Such features are usually generated, for example, from digital elevation models, remote sensing images, etc. An important extension of models concerns considering of real space constrains like geomorphology, networks, and other natural structures. Recent developments in semi-supervised learning can improve modelling of environmental phenomena taking into account on geo-manifolds. An important part of the study deals with the analysis of relevant variables and models' inputs. This problem is approached by using different feature selection/feature extraction nonlinear tools. To demonstrate the application of machine learning algorithms several interesting case studies are considered: digital soil mapping using SVM, automatic mapping of soil and water system pollution using ANN; natural hazards risk analysis (avalanches, landslides), assessments of renewable resources (wind fields) with SVM and ANN models, etc. The dimensionality of spaces considered varies from 2 to more than 30. Figures 1, 2, 3 demonstrate some results of the studies and their outputs. Finally, the results of environmental mapping are discussed and compared with traditional models of geostatistics.
Resumo:
Monitoring of posture allocations and activities enables accurate estimation of energy expenditure and may aid in obesity prevention and treatment. At present, accurate devices rely on multiple sensors distributed on the body and thus may be too obtrusive for everyday use. This paper presents a novel wearable sensor, which is capable of very accurate recognition of common postures and activities. The patterns of heel acceleration and plantar pressure uniquely characterize postures and typical activities while requiring minimal preprocessing and no feature extraction. The shoe sensor was tested in nine adults performing sitting and standing postures and while walking, running, stair ascent/descent and cycling. Support vector machines (SVMs) were used for classification. A fourfold validation of a six-class subject-independent group model showed 95.2% average accuracy of posture/activity classification on full sensor set and over 98% on optimized sensor set. Using a combination of acceleration/pressure also enabled a pronounced reduction of the sampling frequency (25 to 1 Hz) without significant loss of accuracy (98% versus 93%). Subjects had shoe sizes (US) M9.5-11 and W7-9 and body mass index from 18.1 to 39.4 kg/m2 and thus suggesting that the device can be used by individuals with varying anthropometric characteristics.
Resumo:
Data mining can be defined as the extraction of previously unknown and potentially useful information from large datasets. The main principle is to devise computer programs that run through databases and automatically seek deterministic patterns. It is applied in different fields of application, e.g., remote sensing, biometry, speech recognition, but has seldom been applied to forensic case data. The intrinsic difficulty related to the use of such data lies in its heterogeneity, which comes from the many different sources of information. The aim of this study is to highlight potential uses of pattern recognition that would provide relevant results from a criminal intelligence point of view. The role of data mining within a global crime analysis methodology is to detect all types of structures in a dataset. Once filtered and interpreted, those structures can point to previously unseen criminal activities. The interpretation of patterns for intelligence purposes is the final stage of the process. It allows the researcher to validate the whole methodology and to refine each step if necessary. An application to cutting agents found in illicit drug seizures was performed. A combinatorial approach was done, using the presence and the absence of products. Methods coming from the graph theory field were used to extract patterns in data constituted by links between products and place and date of seizure. A data mining process completed using graphing techniques is called ``graph mining''. Patterns were detected that had to be interpreted and compared with preliminary knowledge to establish their relevancy. The illicit drug profiling process is actually an intelligence process that uses preliminary illicit drug classes to classify new samples. Methods proposed in this study could be used \textit{a priori} to compare structures from preliminary and post-detection patterns. This new knowledge of a repeated structure may provide valuable complementary information to profiling and become a source of intelligence.
Resumo:
THESIS ABSTRACT : Stable isotope geochemistry is used to help resolve a large number of geological questions. In order to do this, it is essential to understand the different mechanisms that govern isotopic fractionation processes between different phases and to identify the conditions required to reach equilibrium fractionation. However, at low temperatures, these processes are poorly constrained and many factors can induce differential partitioning of the isotopes between sectors of a mineral species and the fluid during mineral growth. This can result in so-called 'sector zoning' of a mineral species. The aim of this thesis is to evaluate the occurrence of sector zoning of the oxygen isotopes and trace elements in natural α-quartz crystals and to identify the reasons for such zoning. The implications for the fluid-mineral interactions are studied in the context of the Alpine metamorphism. The approach chosen has focused on examining the crystal structure, cathodoluminescence appearance (CL), and on relating elemental (e.g. Li, Na, Al, P, K, Ca, Ge, Ti, Fe) to stable oxygen isotope compositions between and along different growth sectors. Low temperature quartz samples were selected from Alpine veins in different localities, where growth conditions have already been well constrained. The mineralogy as well as the isotopic compositions of the host rocks were also investigated, in order to interpret the variations obtained between the different growth stages in the framework of fluid-rock interaction during Alpine metamorphism. Depending on the growth conditions, most of the studied quartz is strongly zoned in CL, and it reveals corresponding zonations in the trace element content (e.g. growth zoning). Aluminium, substituting for Si in the lattice, was found in concentrations up to 1000's ppma, and its distribution is strongly related to Li and H and to a lesser extent, to Ge. Elemental sector zoning is evident from the distribution of these three elements since they exhibit differences in their respective concentrations between faces for distinct growth zones, with prismatic faces having the lowest Al contents. Quartz from veins in magmatic rocks, for example, tend to have lower Al concentrations and similar concentrations of Li and Ti suggesting also a contribution of these elements from the host rock. The relationship between Al and Li is still correlated. Only Alpine crystals grown at higher temperatures (~400°C) without any CL zoning feature are free of these impurities and do not show such zoning characteristics. Differences in the δ18O values were measured between different faces principally in the AIenriched growth zones or stages. These results were confirmed by the means of two different methods (in situ/non in situ). However, it was determined that the Al concentrations do not affect significantly oxygen isotope fractionations at 300°C. The results altogether suggest that the presence of sector zoning in quartz crystals is real, but not universal, and henceforth should be taken into consideration for any use of these systems. The occurrence of disequilibrium partitioning has been enhanced and is possibly related to kinetic processes as well as structural effects that do not affect similarly trace element incorporation and isotopic fractionation. In situ measurements also revealed fine scale δ18O zonations along growth paths that are useful to constrain fluid-rock interactions during Alpine metamorphism. Variations in the δ18O values present along growth vectors indicate changes in the fluid composition and origin. Association with oxygen isotope composition of the host rock allows for the deduction of interactions between rocks, veins and consequently fluids, as well as fluid regimes. RESUME DE LA THESE : A basses températures, (i.e. <400°C) les différents mécanismes qui régissent le fractionnement isotopique ainsi que les conditions nécessaires pour établir un état d'équilibre sont peu connus et nombre de paramètres peuvent entraîner un partitionnement chimique différentiel entre différents secteurs d'un minéral et le fluide en contact. Ainsi, ce travail de thèse a pour but d'évaluer la possible présence de zonages sectoriels en isotopes de l'oxygène mais aussi en éléments traces dans des cristaux naturels de quartz-α de basses températures, ainsi que les raisons d'un tel phénomène et enfin ses implications sur les interactions fluide-roche, principalement dans le cadre du métamorphisme Alpin. La structure et l'apparence en cathodoluminescence (CL) des échantillons ont été caractérisées avant de retracer en détail les compositions en élément traces (Li, Na, Al, P, K, Ca, Ge, Ti, Fe) et en isotopes de l'oxygène, le long et entre différents secteurs. Les échantillons de quartz sélectionnés proviennent majoritairement de veines Alpine de différentes localités, où les conditions de croissance ont été déjà bien caractérisées. Les compositions minéralogiques et isotopiques de la roche encaissante ont aussi été examinées, pour contraindre les variations obtenues dans un contexte Alpin. Selon leurs conditions de croissance, la plupart des cristaux étudiés sont fortement zonés, ce qui est souligné par un zonage des concentrations en éléments traces (e.g. zonage de croissance). L'Aluminium, qui peut se substituer à la Silice dans le réseau cristallin, a été retrouvé jusqu'en très grandes concentrations dans certaines zones (plusieurs milliers de ppma). De plus, la distribution en Al est fortement liée à celles de Li et H, ainsi que dans une moindre mesure à Ge. La présence de zonage sectoriel est évidente au niveau de ces éléments qui montrent de larges différences de concentrations entre différentes faces pour une même zone de croissance, avec les concentrations les plus basses retrouvées dans les faces prismatiques. Les quartz de veines situées dans des roches magmatiques par exemple possèdent des concentrations en Li et Ti de même ordre de grandeur, confirmant le rôle de la composition de la roche encaissante. La relation Li/Al est toujours fortement présente, mais ce rapport est fonction de la face mesurée. Seuls les cristaux Alpins de plus hautes températures (400°C) ne possédant pas de zones en CL ne présentent aucune de ces caractéristiques. Des différences dans les valeurs de δ18O de zones identiques enrichies en Al ont clairement été mesurées entre les différentes faces r, z, et m, mais aussi au sein d'une même seule zone, indiquant que le fractionnement a probablement eu lieu en déséquilibre. Il a été déterminé que la présence d'Al dans ces teneurs n'avait qu'un faible effet sur le fractionnement isotopique de l'oxygène. L'utilisation de deux méthodes différentes a permis d'obtenir des résultats in situ et non in situ concordants. La comparaison des résultats obtenus permet de démontrer que le zonage sectoriel est bien présent dans certains cristaux de quartz, et dépend des conditions de formation. La présence d'un partitionnement différentiel des éléments traces peut être due à des effets cinétiques aussi bien que structuraux, alors que le zonage sectoriel des isotopes de l'oxygène aurait d'autres origines. Il est alors évident que la possibilité de zonage sectoriel doit être désormais pris en considération avant toute interprétations de données isotopiques de cristaux zonés. Les mesures in situ ont de plus permis de distinguer de fines variations des valeurs δ18O au cours de la croissance, qui peuvent aider à retracer la circulations des fluides dans les Alpes durant cette période. En association avec les compositions des roches encaissantes, ii est possible de déduire les interactions entre roches, veines, et par conséquent fluides, au cours de différentes étapes. RESUME GRAND PUBLIC : La géochimie des isotopes stables a pris beaucoup d'importance depuis ces dernières années pour aider à résoudre nombre de questions géologiques, en se basant sur les caractéristiques du fractionnement isotopiques pour différents systèmes. Il est donc nécessaire d'avoir une connaissance approfondie des mécanismes qui s'appliquent au fractionnement isotopique entre les minéraux et les fluides à partir desquels ils se forment. Ces mécanismes ont été bien approchés par différents types de calibrations pour des systèmes à hautes températures, cependant cela n'est pas aussi évident pour les systèmes à des températures inférieures à 400-500°C. Ce travail de thèse a pour but d'aider à la description et la compréhension des phénomènes qui peuvent affecter le fractionnement isotopique à basses températures, ainsi que leurs implications, à partir de l'étude de cristaux de quartz. Le choix des échantillons s'est porté sur des cristaux naturels formés à des températures inférieures ou égales à 400°C, provenant majoritairement de fissures hydrothermales Alpines dont les conditions de formation ont déjà été déterminées. L'étude des cristaux Alpin permet de plus de replacer les résultats obtenus dans le contexte du métamorphisme Alpin au cours du Miocène (21-13 Ma). Après examen de la structure et de la morphologie des cristaux, et leur caractérisation par cathodoluminescence (CL), des analyses chimiques détaillées sur les éléments en traces pouvant entrer dans le réseau cristallin du quartz comme impuretés (i.e. Li, Na, Al, P, K, Ca, Ge, Ti), et des isotopes stables de l'oxygène, ont été menées. En fonction des conditions de croissance, la plupart des cristaux présentent des zonations, qui peuvent être facilement reliées à la distribution des éléments traces analysés par microsonde électronique, sonde ionique (SIMS) et LA-ICPMS. De fortes concentrations d'Aluminium (plusieurs milliers de parties par million atomique) ont pu être observées dans les zones les plus externes des cristaux. De plus, les concentrations en Al et en Li sont toujours corrélées; la présence d'Hydrogène déduite à partir d'analyses par FTIR suit cette même tendance. Les différentes faces des cristaux présentent des concentrations distinctes d'Al, Li et H pour des mêmes zones de croissance, avec par exemple les concentrations les plus faibles dans les zones des faces prismatiques. Cela implique la présence d'un zonage sectoriel, qui a déjà été observé principalement dans des carbonates mais jamais décrit auparavant pour des quartz. Seuls les cristaux alpins homogènes en CL dont la croissance s'est faite à plus haute température (400°C) ne présentent aucune de ces caractéristiques. Par analogie avec le zonage sectoriel en Al, élément qui se substitue au Si dans le réseau cristallin du quartz, il est possible de penser qu'un zonage sectoriel pourrait aussi s'appliquer aux isotopes de l'oxygène. Des précédentes études avaient en effet émis cette hypothèse. Nos résultats ont été obtenus à partir d'analyses à la fois in- situ par SIMS, et par extraction assistée par laser-CO2 sur des parties de quartz soigneusement séparées, et sont en accord entre les deux méthodes. Un zonage sectoriel est en effet bien présent pour les cristaux alpins, mais principalement au niveau des zones très riches en Aluminium. Cependant, il a été déterminé que la présence d'Al dans ces teneurs avait un effet plus que minimal sur le fractionnement isotopique de l'oxygène. Des différences importantes ont été observées entre les faces r & z mais aussi au sein d'une même et seule zone, indiquant que le fractionnement a pu avoir lieu en déséquilibre, ce qui est aussi visible au niveau des valeurs totalement opposées entre faces pour la dernière phase de croissance de certains cristaux. Ainsi l'association de ces résultats laisse suggérer que la présence d'un zonage sectoriel peut être liée à différents paramètres tels que le taux de croissance ou la structure de surface du cristal, mais qui n'affectent pas de la même façon l'incorporation des éléments traces et le fractionnement isotopique. La possibilité d'un zonage sectoriel est importante à prendre en compte lors de toute interprétation de données isotopiques. Les analyses des isotopes de l'oxygène effectuées par SIMS ont aussi permis de distinguer des variations importantes à petite échelle au cours de la croissance. Des mesures faites par laser CO2 sur certaines roches encaissantes, ont permis distinguer plusieurs étapes dans la croissance des minéraux et de déduire le rôle de l'encaissant et le type de fluide. En association avec de précédentes études, il a été ainsi possible de mieux contraindre la formation de ces cristaux dans le contexte alpin et la circulation de fluide au cours du métamorphisme alpin durant le Miocène.