951 resultados para Visual Speech Recognition, Multiple Views, Frontal View, Profile View


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Malayalam is one of the 22 scheduled languages in India with more than 130 million speakers. This paper presents a report on the development of a speaker independent, continuous transcription system for Malayalam. The system employs Hidden Markov Model (HMM) for acoustic modeling and Mel Frequency Cepstral Coefficient (MFCC) for feature extraction. It is trained with 21 male and female speakers in the age group ranging from 20 to 40 years. The system obtained a word recognition accuracy of 87.4% and a sentence recognition accuracy of 84%, when tested with a set of continuous speech data.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Graphical techniques for modeling the dependencies of randomvariables have been explored in a variety of different areas includingstatistics, statistical physics, artificial intelligence, speech recognition, image processing, and genetics.Formalisms for manipulating these models have been developedrelatively independently in these research communities. In this paper weexplore hidden Markov models (HMMs) and related structures within the general framework of probabilistic independencenetworks (PINs). The paper contains a self-contained review of the basic principles of PINs.It is shown that the well-known forward-backward (F-B) and Viterbialgorithms for HMMs are special cases of more general inference algorithms forarbitrary PINs. Furthermore, the existence of inference and estimationalgorithms for more general graphical models provides a set of analysistools for HMM practitioners who wish to explore a richer class of HMMstructures.Examples of relatively complex models to handle sensorfusion and coarticulationin speech recognitionare introduced and treated within the graphical model framework toillustrate the advantages of the general approach.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

List of references in Harvard format for the accessibility text tutorial created by Denis's Angels.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The first part of this work presents an accurate analysis of the most relevant 3D registration techniques, including initial pose estimation, pairwise registration and multiview registration strategies. A new classification has been proposed, based on both the applications and the approach of the methods that have been discussed. The main contribution of this thesis is the proposal of a new 3D multiview registration strategy. The proposed approach detects revisited regions obtaining cycles of views that are used to reduce the inaccuracies that may exist in the final model due to error propagation. The method takes advantage of both global and local information of the registration process, using graph theory techniques in order correlate multiple views and minimize the propagated error by registering the views in an optimal way. The proposed method has been tested using both synthetic and real data, in order to show and study its behavior and demonstrate its reliability.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper reviews a study to determine the relation between the aided articulation index and the aided speech recognition scores obtained with the Monosyllable, Trochee and Spondee (MTS) Test, when administered to hearing-impaired children.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The equivalency of 34 TIMIT sentence lists was evaluated using adult cochlear implant recipients to determine if they should be recommended for future clinical or research use. Because these sentences incorporate gender, dialect and speaking rate variations, they have the potential to better represent speech recognition abilities in real-world communication situations.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Inconsistencies exist between traditional objective measures such as speech recognition and localization, and subjective reports of bimodal benefit. The purpose of this study was to expand the set of objective measures of bimodal benefit to include non-traditional listening tests, and to examine possible correlations between objective measures of auditory perception and subjective satisfaction reports.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Tactile discrimination performance depends on the receptive field (RF) size of somatosensory cortical (SI) neurons. Psychophysical masking effects can reveal the RF of an idealized "virtual" somatosensory neuron. Previous studies show that top-down factors strongly affect tactile discrimination performance. Here, we show that non-informative vision of the touched body part influences tactile discrimination by modulating tactile RFs. Ten subjects performed spatial discrimination between touch locations on the forearm. Performance was improved when subjects saw their forearm compared to viewing a neutral object in the same location. The extent of visual information was relevant, since restricted view of the forearm did not have this enhancing effect. Vibrotactile maskers were placed symmetrically on either side of the tactile target locations, at two different distances. Overall, masking significantly impaired discrimination performance, but the spatial gradient of masking depended on what subjects viewed. Viewing the body reduced the effect of distant maskers, but enhanced the effect of close maskers, as compared to viewing a neutral object. We propose that viewing the body improves functional touch by sharpening tactile RFs in an early somatosensory map. Top-down modulation of lateral inhibition could underlie these effects.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper argues that transatlantic hybridity connects space, visual style and ideological point of view in British television action-adventure fiction of the 1960s–1970s. It analyses the relationship between the physical location of TV series production at Elstree Studios, UK, the representation of place in programmes, and the international trade in television fiction between the UK and USA. The TV series made at Elstree by the ITC and ABC companies and their affiliates linked Britishness with an international modernity associated with the USA, while also promoting national specificity. To do this, they drew on film production techniques that were already common for TV series production in Hollywood. The British series made at Elstree adapted versions of US industrial organization and television formats, and made programmes expected to be saleable to US networks, on the basis of British experiences in TV co-production with US companies and of the international cinema and TV market.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This Capstone Project attempts to determine the ability of normal hearing children to resolve spectral information, and the relationship between spectral resolution ability and speech recognition ability in noise. This study also examines how these abilities develop with age.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The present study is part of an ongoing investigation into the characteristics of Myxozoan parasites of Brazilian freshwater fish and was carried out using morphology, histopathology and electron microscopy analysis. A new Myxosporea species (Henneguya pseudoplatystoma) is described causing an important reduction in gill function in the farmed pintado (a hybrid fish from a cross between Pseudoplatystoma corruscans and Pseudoplatystoma fasciatum), which is a commercially important South American catfish. From a total of 98 pintado juveniles from fish farms in the states of Sao Paulo and Mato Grosso do Sul (Brazil), 36 samples (36.7%) exhibited infection of the gill filaments. infection was intense, with several plasmodia occurring on a same gill filament. The plasmodia were white and measured up to 0.5 mm in length; mature spores were ellipsoidal in the frontal view, measuring 33.2 +/- 1.9 mu m in total length, 10.4 +/- 0.6 mu m in body length, 3.4 +/- 0.4 mu m in width and 22.7 +/- 1.7 mu m in the caudal process. The polar capsules were elongated, measuring 3.3 +/- 0.4 mu m in length and 1.0 +/- 0.1 mu m in width and the polar filaments had six to seven turns. Histopathological analysis revealed the parasite in the connective tissue of the gill filaments and lamella. No inflammatory process was observed, but the development of the plasmodia reduced the area of functional epithelium. Ultrastructural analyses revealed a single plasmodial wall, which was in direct contact with the host cells and had numerous projections in direction of the host cells as well as extensive pinocytotic canals. A thick layer (2-6 mu m) of fibrous material and numerous mitochondria were found in the ectoplasm. Generative cells and the earliest stage of sporogenesis were seen more internally. Advanced spore developmental stages and mature spores were found in the central portion of the plasmodia. (C) 2009 Elsevier B.V. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A new myxosporean species, Henneguya eirasi n. sp., is described parasitizing the gill filaments of Pseudoplatystoma corruscans and Pseudoplatystoma fasciatum (Siluriformes: Pimelodidae) caught in the Patanal Wetland of the state of Mato Grosso, Brazil. The parasite formed white, elongated plasmodia measuring up to 3 mm. Mature spores were ellipsoidal in the frontal view, measuring 37.1 +/- 1.8 mu m in total length, 12.9 +/- 0.8 mu m in body length, 3.4 +/- 0.3 mu m in width, 3.1 +/- 0.1 mu m in thickness and 24.6 +/- 2.2 mu m in the caudal process. Polar capsules were elongated and equal in size, measuring 5.4 +/- 0.5 mu m in length and 0.7 +/- 0.1 mu m in width. Polar filaments had 12-13 coils. Histopathological analysis revealed that the parasite developed in the sub-epithelial connective tissue of the gill filaments and the plasmodia were surrounded by a capsule of host connective tissue. The plasmodia caused slight compression of the adjacent tissues, but no inflammatory response was observed in the infection site. Ultrastructure analysis revealed a single plasmodial wall connected to the ectoplasmic zone through numerous pinocytotic canals. The plasmodial wall exhibited numerous projections and slightly electron-dense material was found in the ectoplasm next to the plasmodial wall, forming a line just below the wall. Partial sequencing of the 18S rDNA gene of H. eirasi n. sp. obtained from P. fasciatum resulted in a total of 1066 bp and this sequence did not match any of the Myxozoa available in the GenBank. Phylogenetic analysis revealed the Henneguya species clustering into clades following the order and family of the host fishes. H. eirasi n. sp. clustered alone in one clade, which was the basal unit for the clade composed of Henneguya species parasites of siluriform ictalurids. The prevalence of the parasite was 17.1% in both fish species examined. Parasite prevalence was not influenced by season, host sex or host size. (C) 2011 Elsevier B.V. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Dynamic Time Warping (DTW), a pattern matching technique traditionally used for restricted vocabulary speech recognition, is based on a temporal alignment of the input signal with the template models. The principal drawback of DTW is its high computational cost as the lengths of the signals increase. This paper shows extended results over our previously published conference paper, which introduces an optimized version of the DTW I hat is based on the Discrete Wavelet Transform (DWT). (C) 2008 Elsevier B.V. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A strong brand can help a company to be easier to recognize and be remembered by their customers. Part of branding is to create a visual appearance which is called a graphic profile and contains information on for instance a logo, colours or typography.The objective of this thesis was to create a graphic profile for Stjørdal Tannhelsesenter that could serve as a base for a brand work in the future. It was important to analyze how the clinic wants to be perceived and how it is perceived today. The methods used to carry out this study were questionnaires, researches and a focus group.The work resulted in the generation of a graphic profile that included a new logo, colours, decorative elements, fonts, templates for stationery, business card / badge, imagery, and examples of some publications.According to the test which was made to examine the results of the project showed that I was successful and that the new design meets all the requests from the previous questionnaires.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Allt eftersom utvecklingen går framåt inom applikationer och system så förändras också sättet på vilket vi interagerar med systemet på. Hittills har navigering och användning av applikationer och system mestadels skett med händerna och då genom mus och tangentbord. På senare tid så har navigering via touch-skärmar och rösten blivit allt mer vanligt. Då man ska styra en applikation med hjälp av rösten är det viktigt att vem som helst kan styra applikationen, oavsett vilken dialekt man har. För att kunna se hur korrekt ett röstigenkännings-API (Application Programming Interface) uppfattar svenska dialekter så initierades denna studie med dokumentstudier om dialekters kännetecken och ljudkombinationer. Dessa kännetecken och ljudkombinationer låg till grund för de ord vi valt ut till att testa API:et med. Varje dialekt fick alltså ett ord uppbyggt för att vara extra svårt för API:et att uppfatta när det uttalades av just den aktuella dialekten. Därefter utvecklades en prototyp, närmare bestämt en android-applikation som fungerade som ett verktyg i datainsamlingen. Då arbetet innehåller en prototyp och en undersökning så valdes Design and Creation Research som forskningsstrategi med datainsamlingsmetoderna dokumentstudier och observationer för att få önskat resultat. Data samlades in via observationer med prototypen som hjälpmedel och med hjälp av dokumentstudier. Det empiriska data som registrerats via observationerna och med hjälp av applikationen påvisade att vissa dialekter var lättare för API:et att uppfatta korrekt. I vissa fall var resultaten väntade då vissa ord uppbyggda av ljudkombinationer i enlighet med teorin skulle uttalas väldigt speciellt av en viss dialekt. Ibland blev det väldigt låga resultat på just dessa ord men i andra fall förvånansvärt höga. Slutsatsen vi drog av detta var att de ord vi valt ut med en baktanke om att de skulle få låga resultat för den speciella dialekten endast visade sig stämma vid två tillfällen. Det var istället det ord innehållande sje- och tje-ljud som enligt teorin var gemensamma kännetecken för alla dialekter som fick lägst resultat överlag.