Biblioteca Digital

17 resultados para Chinese characters.

Exploiting Chinese character models to improve speech recognition performance

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Chinese language is based on characters which are syllabic in nature. Since languages have syllabotactic rules which govern the construction of syllables and their allowed sequences, Chinese character sequence models can be used as a first level approximation of allowed syllable sequences. N-gram character sequence models were trained on 4.3 billion characters. Characters are used as a first level recognition unit with multiple pronunciations per character. For comparison the CU-HTK Mandarin word based system was used to recognize words which were then converted to character sequences. The character only system error rates for one best recognition were slightly worse than word based character recognition. However combining the two systems using log-linear combination gives better results than either system separately. An equally weighted combination gave consistent CER gains of 0.1-0.2% absolute over the word based standard system. Copyright © 2009 ISCA.

Exploiting Chinese character models to improve speech recognition performance

Relevância:

20.00% 20.00%

Publicador:

Contract manufacturing service providers in Chinese manufacturing - emergence, catching-up, and over-taking

Relevância:

20.00% 20.00%

Publicador:

Manufacturing globalisation and development of Chinese manufacturing capability

Relevância:

20.00% 20.00%

Publicador:

New generation of vertical integration: new approach of chinese contractual manufacturers

Relevância:

20.00% 20.00%

Publicador:

A preliminary framework to match OEM's requirements and Chinese EMS capability

Relevância:

20.00% 20.00%

Publicador:

The John-Hopkins University 2003 Chinese-English machine translation system

Relevância:

20.00% 20.00%

Publicador:

Strategy of Multinational Corporations for Chinese Enterprises

Relevância:

20.00% 20.00%

Publicador:

A dialectal Chinese speech recognition framework

Relevância:

20.00% 20.00%

Publicador:

Research on the box counting algorithm in fractal dimension measurement (In Chinese)

Relevância:

20.00% 20.00%

Publicador:

Developing Chinese collaborative manufacturing: from the perspective of the Western manufacturing system evolution

Relevância:

20.00% 20.00%

Publicador:

Current human resource practices in Chinese entrepreneurial firms and their development over the firms' Growth Trajectory'

Relevância:

20.00% 20.00%

Publicador:

3C Framework for Modular Supply Networks in the Chinese Automotive Industry

Relevância:

20.00% 20.00%

Publicador:

Shanzhai Manufacturing – an alternative innovation phenomenon in China: its value chain and implications for Chinese science and technology policies

Relevância:

20.00% 20.00%

Publicador:

Syllable language models for Mandarin speech recognition: exploiting character language models.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Mandarin Chinese is based on characters which are syllabic in nature and morphological in meaning. All spoken languages have syllabiotactic rules which govern the construction of syllables and their allowed sequences. These constraints are not as restrictive as those learned from word sequences, but they can provide additional useful linguistic information. Hence, it is possible to improve speech recognition performance by appropriately combining these two types of constraints. For the Chinese language considered in this paper, character level language models (LMs) can be used as a first level approximation to allowed syllable sequences. To test this idea, word and character level n-gram LMs were trained on 2.8 billion words (equivalent to 4.3 billion characters) of texts from a wide collection of text sources. Both hypothesis and model based combination techniques were investigated to combine word and character level LMs. Significant character error rate reductions up to 7.3% relative were obtained on a state-of-the-art Mandarin Chinese broadcast audio recognition task using an adapted history dependent multi-level LM that performs a log-linearly combination of character and word level LMs. This supports the hypothesis that character or syllable sequence models are useful for improving Mandarin speech recognition performance.

«
1
2
»