902 resultados para Voice interfaces


Relevância:

20.00% 20.00%

Publicador:

Resumo:

As the telecommunications industry evolves over the next decade to provide the products and services that people will desire, several key technologies will become commonplace. Two of these, automatic speech recognition and text-to-speech synthesis, will provide users with more freedom on when, where, and how they access information. While these technologies are currently in their infancy, their capabilities are rapidly increasing and their deployment in today's telephone network is expanding. The economic impact of just one application, the automation of operator services, is well over $100 million per year. Yet there still are many technical challenges that must be resolved before these technologies can be deployed ubiquitously in products and services throughout the worldwide telephone network. These challenges include: (i) High level of accuracy. The technology must be perceived by the user as highly accurate, robust, and reliable. (ii) Easy to use. Speech is only one of several possible input/output modalities for conveying information between a human and a machine, much like a computer terminal or Touch-Tone pad on a telephone. It is not the final product. Therefore, speech technologies must be hidden from the user. That is, the burden of using the technology must be on the technology itself. (iii) Quick prototyping and development of new products and services. The technology must support the creation of new products and services based on speech in an efficient and timely fashion. In this paper I present a vision of the voice-processing industry with a focus on the areas with the broadest base of user penetration: speech recognition, text-to-speech synthesis, natural language processing, and speaker recognition technologies. The current and future applications of these technologies in the telecommunications industry will be examined in terms of their strengths, limitations, and the degree to which user needs have been or have yet to be met. Although noteworthy gains have been made in areas with potentially small user bases and in the more mature speech-coding technologies, these subjects are outside the scope of this paper.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper describes a range of opportunities for military and government applications of human-machine communication by voice, based on visits and contacts with numerous user organizations in the United States. The applications include some that appear to be feasible by careful integration of current state-of-the-art technology and others that will require a varying mix of advances in speech technology and in integration of the technology into applications environments. Applications that are described include (1) speech recognition and synthesis for mobile command and control; (2) speech processing for a portable multifunction soldier's computer; (3) speech- and language-based technology for naval combat team tactical training; (4) speech technology for command and control on a carrier flight deck; (5) control of auxiliary systems, and alert and warning generation, in fighter aircraft and helicopters; and (6) voice check-in, report entry, and communication for law enforcement agents or special forces. A phased approach for transfer of the technology into applications is advocated, where integration of applications systems is pursued in parallel with advanced research to meet future needs.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper describes the state of the art in applications of voice-processing technologies. In the first part, technologies concerning the implementation of speech recognition and synthesis algorithms are described. Hardware technologies such as microprocessors and DSPs (digital signal processors) are discussed. Software development environment, which is a key technology in developing applications software, ranging from DSP software to support software also is described. In the second part, the state of the art of algorithms from the standpoint of applications is discussed. Several issues concerning evaluation of speech recognition/synthesis algorithms are covered, as well as issues concerning the robustness of algorithms in adverse conditions.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This talk, which was the keynote address of the NAS Colloquium on Human-Machine Communication by Voice, discusses the past, present, and future of human-machine communications, especially speech recognition and speech synthesis. Progress in these technologies is reviewed in the context of the general progress in computer and communications technologies.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In molecular biology, the expression of fusion proteins is a very useful and well-established technique for the identification and one-step purification of gene products. Even a short fused sequence of five or six histidines enables proteins to bind to an immobilized metal ion chelate complex. By synthesis of a class of chelator lipids, we have transferred this approach to the concept of self-assembly. The specific interaction and lateral organization of a fluorescent fusion molecule containing a C-terminal oligohistidine sequence was studied by film balance techniques in combination with epifluorescence microscopy. Due to the phase behavior of the various lipid mixtures used, the chelator lipids can be laterally structured, generating two-dimensional arrays of histidine-tagged biomolecules. Because of the large variety of fusion proteins already available, this concept represents a powerful technique for orientation and organization of proteins at lipid interfaces with applications in biosensing, biofunctionalization of nanostructured interfaces, two-dimensional crystallization, and studies of lipid-anchored proteins.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In artificial multiferroics hybrids consisting of ferromagnetic La_(0.7)Sr_(0.3)MnO_(3) (LSMO) and ferroelectric BaTiO_(3) epitaxial layers, net Ti moments are found from polarized resonant soft x-ray reflectivity and absorption. The Ti dichroic reflectivity follows the Mn signal during the magnetization reversal, indicating exchange coupling between the Ti and Mn ions. However, the Ti dichroic reflectivity shows stronger temperature dependence than the Mn dichroic signal. Besides a reduced ferromagnetic exchange coupling in the interfacial LSMO layer, this may also be attributed to a weak Ti-Mn exchange coupling that is insufficient to overcome the thermal energy at elevated temperatures.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The interface between Au(hkl) basal planes and the ionic liquid 1-Ethyl-2,3-dimethyl imidazolium bis(trifluoromethyl)sulfonil imide was investigated by using both cyclic voltammetry and laser-induced temperature jump. Cyclic voltammetry showed characteristic features, revealing surface sensitive processes at the interfaces Au(hkl)/[Emmim][Tf2N]. From laser-induced heating the potential of maximum entropy (pme) is determined. Pme is close to the potential of zero charge (pzc) and, therefore, the technique provides relevant interfacial information. The following order for the pme values has been found: Au(111) > Au(100) > Au(110). This order correlates well with work function data and values of pzc in aqueous solutions.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The goal of this work was to provide professional and amateur writers with a new way of enhancing their productivity and mental well-being, by helping them overcoming writers block and being able to achieve a state of optimal experience while writing. Our approach is based on bringing together different components to create what we call a creative moment. A creative moment is composed by an image, a text, a mood, a location and a color. The color presented in the creative moment varied according to the mood that was associated to the creative moment. With the creative moments we hoped that our users could have a way to easily trigger their creativity and have a kick start in their work. The prototyping of a web crowdsourcing platform, named CreativeWall, and a Microsoft Word Add-In, that was used on the user study performed, is described and their implementations are discussed. The user study reveals that our approach does have a positive influence in the productivity of the participants when compared with another existing approach. The study also revealed that our approach can ease the process of achieving a state of optimal experience by enhancing one of the dimensions presented on the Flow Theory. At the end we present what we consider would be some possible future developments for the concept created during the development of this work.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In voice and alignment typology, a categorical distinction is generally made between inverse systems on the one hand and symmetrical voice systems on the other. A major reason for distinguishing between these two types is the assumption that inverse systems are governed by a hierarchy involving grammatical, semantic, and ontological criteria, while symmetrical voice systems are based on discourse-pragmatic factors. However, the two types also have several important properties in common, in particular the fact that they have more than one nonderived transitive construction. Based on data from three native languages of South America, we show that the line between the two types is not always easy to draw, and that features of the inverse type can coexist with those of the symmetrical-voice type in the same language.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

English and German words with piano accompaniment.