897 resultados para (hyper)text
Resumo:
This paper describes recent improvements to the Cambridge Arabic Large Vocabulary Continuous Speech Recognition (LVCSR) Speech-to-Text (STT) system. It is shown that wordboundary context markers provide a powerful method to enhance graphemic systems by implicit phonetic information, improving the modelling capability of graphemic systems. In addition, a robust technique for full covariance Gaussian modelling in the Minimum Phone Error (MPE) training framework is introduced. This reduces the full covariance training to a diagonal covariance training problem, thereby solving related robustness problems. The full system results show that the combined use of these and other techniques within a multi-branch combination framework reduces the Word Error Rate (WER) of the complete system by up to 5.9% relative. Copyright © 2011 ISCA.
Resumo:
The paper describes a new approach to artificial intelligence (AI) and its role in design. This approach argues that AI can be seen as 'text', or in other words as a medium for the communication of design knowledge and information between designers. This paper will apply these ideas to reinterpreting an existing knowledge-based system (KBS) design tool, that is, CADET - a product design evaluation tool. The paper will discuss the authorial issues, amongst others, involved in the development of AI and KBS design tools by adopting this new approach. Consequently, the designers' rights and responsibilities will be better understood as the knowledge medium, through its concern with authorship, returns control to users rather than attributing the system with agent status. © 1998 Elsevier Science Ltd. All rights reserved.
Resumo:
Creating a realistic talking head, which given an arbitrary text as input generates a realistic looking face speaking the text, has been a long standing research challenge. Talking heads which cannot express emotion have been made to look very realistic by using concatenative approaches [Wang et al. 2011], however allowing the head to express emotion creates a much more challenging problem and model based approaches have shown promise in this area. While 2D talking heads currently look more realistic than their 3D counterparts, they are limited both in the range of poses they can express and in the lighting conditions that they can be rendered under. Previous attempts to produce videorealistic 3D expressive talking heads [Cao et al. 2005] have produced encouraging results but not yet achieved the level of realism of their 2D counterparts.
Resumo:
This paper presents a complete system for expressive visual text-to-speech (VTTS), which is capable of producing expressive output, in the form of a 'talking head', given an input text and a set of continuous expression weights. The face is modeled using an active appearance model (AAM), and several extensions are proposed which make it more applicable to the task of VTTS. The model allows for normalization with respect to both pose and blink state which significantly reduces artifacts in the resulting synthesized sequences. We demonstrate quantitative improvements in terms of reconstruction error over a million frames, as well as in large-scale user studies, comparing the output of different systems. © 2013 IEEE.
Resumo:
Electro-optic switching in short-pitch polymer stabilized chiral nematic liquid crystals was studied and the relative contributions of flexoelectric and dielectric coupling were investigated: polymer stabilization was found to effectively suppress unwanted textural transitions of the chiral nematic liquid crystal and thereby enhance the electro-optical performance (high optical contrast for visible light, a near ideal optical hysteresis, fast electro-optic response). Test cells were studied that possessed interdigitated electrodes to electrically address the liquid crystal. Based on simulations, a well-fitted phenomenological description of the electro-optic response was derived considering both flexoelectro-optic and Kerr-effect based electro-optic response. © 2014 AIP Publishing LLC.
Resumo:
This paper presents an overview of the Text-to-Speech synthesis system developed at the Institute for Language and Speech Processing (ILSP). It focuses on the key issues regarding the design of the system components. The system currently fully supports three languages (Greek, English, Bulgarian) and is designed in such a way to be as language and speaker independent as possible. Also, experimental results are presented which show that the system produces high quality synthetic speech in terms of naturalness and intelligibility. The system was recently ranked among the first three systems worldwide in terms of achieved quality for the English language, at the international Blizzard Challenge 2013 workshop. © 2014 Springer International Publishing.
Resumo:
To clarify the possible influence of Microcystis blooms on the exchange of phosphorus (P) between sediment and lake water, an enclosure experiment was conducted in the hypereutrophic subtropical Lake Donghu during July-September 2000. Eight enclosures were used: six received sediment while two were sediment-free. In mid-August, Microcystis blooms developed in all the enclosures. There was a persistent coincidence between the occurrence of Microcystis blooms and the increase of both total P (TP) and soluble reactive P (SRP) concentrations in the water of the enclosures with sediments. In sediment-free enclosures, TP and SRP concentrations remained rather stable throughout the experiment, in spite of the appearance of Microcystis blooms. The results indicate that Microcystis blooms induced massive release of P from the sediment, perhaps mediated by high pH caused by intense algal photosynthesis, and/or depressed concentrations of nitrate nitrogen (NO3-N). (C) 2002 Elsevier Science Ltd. All rights reserved.
Resumo:
We studied the application of Biomimetic Pattern Recognition to speaker recognition. A speaker recognition neural network using network matching degree as criterion is proposed. It has been used in the system of text-dependent speaker recognition. Experimental results show that good effect could be obtained even with lesser samples. Furthermore, the misrecognition caused by untrained speakers occurring in testing could be controlled effectively. In addition, the basic idea "cognition" of Biomimetic Pattern Recognition results in no requirement of retraining the old system for enrolling new speakers.
Resumo:
We studied the application of Biomimetic Pattern Recognition to speaker recognition. A speaker recognition neural network using network matching degree as criterion is proposed. It has been used in the system of text-dependent speaker recognition. Experimental results show that good effect could be obtained even with lesser samples. Furthermore, the misrecognition caused by untrained speakers occurring in testing could be controlled effectively. In addition, the basic idea "cognition" of Biomimetic Pattern Recognition results in no requirement of retraining the old system for enrolling new speakers.