24 resultados para Speech Recognition Systems

em BORIS: Bern Open Repository and Information System - Berna - Suiça


Relevância:

90.00% 90.00%

Publicador:

Resumo:

Smart homes for the aging population have recently started attracting the attention of the research community. The "health state" of smart homes is comprised of many different levels; starting with the physical health of citizens, it also includes longer-term health norms and outcomes, as well as the arena of positive behavior changes. One of the problems of interest is to monitor the activities of daily living (ADL) of the elderly, aiming at their protection and well-being. For this purpose, we installed passive infrared (PIR) sensors to detect motion in a specific area inside a smart apartment and used them to collect a set of ADL. In a novel approach, we describe a technology that allows the ground truth collected in one smart home to train activity recognition systems for other smart homes. We asked the users to label all instances of all ADL only once and subsequently applied data mining techniques to cluster in-home sensor firings. Each cluster would therefore represent the instances of the same activity. Once the clusters were associated to their corresponding activities, our system was able to recognize future activities. To improve the activity recognition accuracy, our system preprocessed raw sensor data by identifying overlapping activities. To evaluate the recognition performance from a 200-day dataset, we implemented three different active learning classification algorithms and compared their performance: naive Bayesian (NB), support vector machine (SVM) and random forest (RF). Based on our results, the RF classifier recognized activities with an average specificity of 96.53%, a sensitivity of 68.49%, a precision of 74.41% and an F-measure of 71.33%, outperforming both the NB and SVM classifiers. Further clustering markedly improved the results of the RF classifier. An activity recognition system based on PIR sensors in conjunction with a clustering classification approach was able to detect ADL from datasets collected from different homes. Thus, our PIR-based smart home technology could improve care and provide valuable information to better understand the functioning of our societies, as well as to inform both individual and collective action in a smart city scenario.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

OBJECTIVE To evaluate the speech intelligibility in noise with a new cochlear implant (CI) processor that uses a pinna effect imitating directional microphone system. STUDY DESIGN Prospective experimental study. SETTING Tertiary referral center. PATIENTS Ten experienced, unilateral CI recipients with bilateral severe-to-profound hearing loss. INTERVENTION All participants performed speech in noise tests with the Opus 2 processor (omnidirectional microphone mode only) and the newer Sonnet processor (omnidirectional and directional microphone mode). MAIN OUTCOME MEASURE The speech reception threshold (SRT) in noise was measured in four spatial settings. The test sentences were always presented from the front. The noise was arriving either from the front (S0N0), the ipsilateral side of the CI (S0NIL), the contralateral side of the CI (S0NCL), or the back (S0N180). RESULTS The directional mode improved the SRTs by 3.6 dB (p < 0.01), 2.2 dB (p < 0.01), and 1.3 dB (p < 0.05) in the S0N180, S0NIL, and S0NCL situations, when compared with the Sonnet in the omnidirectional mode. There was no statistically significant difference in the S0N0 situation. No differences between the Opus 2 and the Sonnet in the omnidirectional mode were observed. CONCLUSION Speech intelligibility with the Sonnet system was statistically different to speech recognition with the Opus 2 system suggesting that CI users might profit from the pinna effect imitating directionality mode in noisy environments.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The level of improvement in the audiological results of Baha(®) users mainly depends on the patient's preoperative hearing thresholds and the type of Baha sound processor used. This investigation shows correlations between the preoperative hearing threshold and postoperative aided thresholds and audiological results in speech understanding in quiet of 84 Baha users with unilateral conductive hearing loss, bilateral conductive hearing loss and bilateral mixed hearing loss. Secondly, speech understanding in noise of 26 Baha users with different Baha sound processors (Compact, Divino, and BP100) is investigated. Linear regression between aided sound field thresholds and bone conduction (BC) thresholds of the better ear shows highest correlation coefficients and the steepest slope. Differences between better BC thresholds and aided sound field thresholds are smallest for mid-frequencies (1 and 2 kHz) and become larger at 0.5 and 4 kHz. For Baha users, the gain in speech recognition in quiet can be expected to lie in the order of magnitude of the gain in their hearing threshold. Compared to its predecessor sound processors Baha(®) Compact and Baha(®) Divino, Baha(®) BP100 improves speech understanding in noise significantly by +0.9 to +4.6 dB signal-to-noise ratio, depending on the setting and the use of directional microphone. For Baha users with unilateral and bilateral conductive hearing loss and bilateral mixed hearing loss, audiological results in aided sound field thresholds can be estimated with the better BC hearing threshold. The benefit in speech understanding in quiet can be expected to be similar to the gain in their sound field hearing threshold. The most recent technology of Baha sound processor improves speech understanding in noise by an order of magnitude that is well perceived by users and which can be very useful in everyday life.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Individual recognition systems require the sender to be individually distinctive and the receiver to be able to perceive differences between individuals and react accordingly. Many studies have demonstrated that acoustic signals of almost any species contain individualized information. However, fewer studies have tested experimentally if those signals are used for individual recognition by potential receivers. While laboratory studies using zebra finches have shown that fledglings recognize their parents by their “distance call”, mutual recognition using the same call type has not been demonstrated yet. In a laboratory study with zebra finches, we first quantified between-individual acoustic variation in distance calls of fledglings. In a second step, we tested recognition of fledgling calls by parents using playback experiments. With a discriminant function analysis, we show that individuals are highly distinctive and most measured parameters show very high potential to encode for individuality. The response pattern of zebra finch parents shows that they do react to calls of fledglings, however they do not distinguish between own and unfamiliar offspring, despite individual distinctiveness. This finding is interesting in light of the observation of a high percentage of misdirected feedings in our communal breeding aviaries. Our results demonstrate the importance of adopting a receiver's perspective and suggest that variation in fledgling contact calls might not be used in individual recognition of offspring.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Crowdsourcing linguistic phenomena with smartphone applications is relatively new. Apps have been used to train acoustic models for automatic speech recognition (de Vries et al. 2014) and to archive endangered languages (Iwaidja Inyaman Team 2012). Leemann and Kolly (2013) developed a free app for iOS—Dialäkt Äpp (DÄ) (>78k downloads)—to document language change in Swiss German. Here, we present results of sound change based on DÄ data. DÄ predicts the users’ dialects: for 16 variables, users select their dialectal variant. DÄ then tells users which dialect they speak. Underlying this prediction are maps from the Linguistic Atlas of German-speaking Switzerland (SDS, 1962-2003), which documents the linguistic situation around 1950. If predicted wrongly, users indicate their actual dialect. With this information, the 16 variables can be assessed for language change. Results revealed robustness of phonetic variables; lexical and morphological variables were more prone to change. Phonetic variables like to lift (variants: /lupfə, lʏpfə, lipfə/) revealed SDS agreement scores of nearly 85%, i.e., little sound change. Not all phonetic variables are equally robust: ladle (variants: /xælə, xællə, xæuə, xæɫə, xæɫɫə/) exhibited significant sound change. We will illustrate the results using maps that show details of the sound changes at hand.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

There is increasing recognition that transdisciplinary approaches are needed to create suitable knowledge for sustainable water management. However, there is no common understanding of what transdisciplinary research may be and there is very limited debate on potentials and challenges regarding its implementation. Against this background, this paper presents a conceptual framework for transdisciplinary co-production of knowledge in water management projects oriented towards more sustainable use of water. Moreover, first experiences with its implementation are discussed. In so doing, the focus lies on potentials and challenges related to the co-production of systems, target and transformation knowledge by researchers and local stakeholders.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper proposes a sequential coupling of a Hidden Markov Model (HMM) recognizer for offline handwritten English sentences with a probabilistic bottom-up chart parser using Stochastic Context-Free Grammars (SCFG) extracted from a text corpus. Based on extensive experiments, we conclude that syntax analysis helps to improve recognition rates significantly.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

ackground: Although the frequency of associated malformation is high, the incidence of inheritable syndromes is widely underestimated in children with anorectal malformation (ARM). Data sources: OMIM database, patient records and charts of the Department of Pediatric Surgery, Johannes Gutenberg-University, Mainz, Germany. Methods: We analyzed all associations, sequences and syndromes listed in the OMIM database that can be accompanied by ARM. A large cohort of children born with ARM was then retrospectively investigated as to the type of ARM, presence of additional malformations and possible categorization as a syndrome, sequence or association. For this process a syndrome finder was developed and employed. This simplistic tool allows for a rapid first check of possible syndromes before a more complex analysis is started using the OMIM database and consulting specialists. Results: Among 317 children with ARM, associated malformations were present in 77.7% of 127 children with high ARM, in 68.7% of 32 with intermediate ARM, and in 25.3% of 158 with a low type ARM. Three or more organ systems were involved in 29.1% children with high type ARM and 25% with intermediate ARM and 8.2% with a low type ARM. An association of the vertebral anal tracheo-esophageal renal (VATER) and vertebral anal cardiac tracheo-esophageal renal limb (VACTERL) type was found in a total of 35 patients. Before analysis, 11 syndromes and 35 associations which were not clear previously in this patient cohort were described. In other 17 patients, 14 syndromes and 3 associations were identified. Conclusions: The high number of only retrospectively identified syndromes suggests that a routine search is necessary in every patient with ARM and additional malformations.