989 resultados para minority script processing


Relevância:

100.00% 100.00%

Publicador:

Resumo:

一次开发多语言使用是国际化软件开发的主要目标。但是世界上的文字多种多样,它们的书写方向也有所不同,除了水平从左向右书写的英文、水平从右往左书写的阿拉伯文外,还有类似蒙古文这样垂直排列的文字,这对计算机图形用户界面提出了更高的要求,现有的计算机系统将这类垂直排列的文字沿水平方向输出,极不符合少数民族人民的习惯。在分析现有Qt库对类似阿拉伯文这样从右向左书写的文字的部分支持机制的基础上,我们设计并实现了支持四种方向模式的国际化的图形用户界面,现在它已经能够适应世界上几乎所有的文字。这对于软件国际化以及民族语言信息处理有重要意义。

Relevância:

80.00% 80.00%

Publicador:

Resumo:

基于ISO/ IEC 10646和UNICODE国际标准,用传统的字体技术(如TrueType)来实现少数民族文字处理所面临的一个"瓶颈"问题是:"变形显现字符"不存在确定的码位.这也是多年来民文系统重复开发、互不兼容的根本原因.本文基于ICU的文字处理体系结构,阐述了完全支持Unicode标准的少数民族文字(本文主要指蒙古文字、维文、藏文等)的实现方法.文中首先介绍了少数民族文字的特点,分析其与拉丁文字、汉字在计算机输入、输出过程中的不同之处,并指出少数民族文字处理的难点.其次介绍了一种能满足少数民族文字处理需求的字体技术--OpenType.最后,阐述了文字处理引擎的工作原理,以及ICU中如何实现对少数民族文字的支持.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

复杂文字在显示输出的过程中,表现出极为复杂的语言特征.为此提出了一种基于谓词规则的复杂文字处理模型,模型以谓词规则的方法给出了复杂文字字形布局特征的形式化描述,按照复杂文字处理的过程,设计了实现该模型的软件体系结构,将复杂文字的语言特征从程序控制逻辑中隔离出来,提高了系统的灵活性,便于增加新的复杂文字的支持.在研制蒙古文、藏文、维吾尔文办公套件的应用中表明,该模型是实用有效的.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

文档处理是文字处理的关键组成部分,针对多语言混合排版的需求,本文提出了基于“框”的支持不同方向的多语言文本布局的文档处理模型。该模型把时文本布局方向的处理封装在文档格式化模块中,将多文本布局方向的问题规约为文本布局方向为从左向右(水平)的文档格式化的问题,并设计了多文本布局方向文档格式化的递归算法。该模型可以很好支持包括我国民族文字蒙古文、维吾尔文、藏文在内的各种不同书写方向文字的文本布局。

Relevância:

40.00% 40.00%

Publicador:

Resumo:

To date, studies have focused on the acquisition of alphabetic second languages (L2s) in alphabetic first language (L1) users, demonstrating significant transfer effects. The present study examined the process from a reverse perspective, comparing logographic (Mandarin-Chinese) and alphabetic (English) L1 users in the acquisition of an artificial logographic script, in order to determine whether similar language-specific advantageous transfer effects occurred. English monolinguals, English-French bilinguals and Chinese-English bilinguals learned a small set of symbols in an artificial logographic script and were subsequently tested on their ability to process this script in regard to three main perspectives: L2 reading, L2 working memory (WM), and inner processing strategies. In terms of L2 reading, a lexical decision task on the artificial symbols revealed markedly faster response times in the Chinese-English bilinguals, indicating a logographic transfer effect suggestive of a visual processing advantage. A syntactic decision task evaluated the degree to which the new language was mastered beyond the single word level. No L1-specific transfer effects were found for artificial language strings. In order to investigate visual processing of the artificial logographs further, a series of WM experiments were conducted. Artificial logographs were recalled under concurrent auditory and visuo-spatial suppression conditions to disrupt phonological and visual processing, respectively. No L1-specific transfer effects were found, indicating no visual processing advantage of the Chinese-English bilinguals. However, a bilingual processing advantage was found indicative of a superior ability to control executive functions. In terms of L1 WM, the Chinese-English bilinguals outperformed the alphabetic L1 users when processing L1 words, indicating a language experience-specific advantage. Questionnaire data on the cognitive strategies that were deployed during the acquisition and processing of the artificial logographic script revealed that the Chinese-English bilinguals rated their inner speech as lower than the alphabetic L1 users, suggesting that they were transferring their phonological processing skill set to the acquisition and use of an artificial script. Overall, evidence was found to indicate that language learners transfer specific L1 orthographic processing skills to L2 logographic processing. Additionally, evidence was also found indicating that a bilingual history enhances cognitive performance in L2.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Studies of orthographic skills transfer between languages focus mostly on working memory (WM) ability in alphabetic first language (L1) speakers when learning another, often alphabetically congruent, language. We report two studies that, instead, explored the transferability of L1 orthographic processing skills in WM in logographic-L1 and alphabetic-L1 speakers. English-French bilingual and English monolingual (alphabetic-L1) speakers, and Chinese-English (logographic-L1) speakers, learned a set of artificial logographs and associated meanings (Study 1). The logographs were used in WM tasks with and without concurrent articulatory or visuo-spatial suppression. The logographic-L1 bilinguals were markedly less affected by articulatory suppression than alphabetic-L1 monolinguals (who did not differ from their bilingual peers). Bilinguals overall were less affected by spatial interference, reflecting superior phonological processing skills or, conceivably, greater executive control. A comparison of span sizes for meaningful and meaningless logographs (Study 2) replicated these findings. However, the logographic-L1 bilinguals’ spans in L1 were measurably greater than those of their alphabetic-L1 (bilingual and monolingual) peers; a finding unaccounted for by faster articulation rates or differences in general intelligence. The overall pattern of results suggests an advantage (possibly perceptual) for logographic-L1 speakers, over and above the bilingual advantage also seen elsewhere in third language (L3) acquisition.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Two experiments investigated the extent of message processing of a persuasive communication proposed by either a numerical majority or minority. Both experiments crossed source status (majority versus minority) with message quality (strong versus weak arguments) to determine which source condition is associated with systematic processing. The first experiment showed a reliable difference between strong and weak messages, indicating systematic processing had occurred, for a minority irrespective of message direction (pro- versus counter-attitudinal), but not for a majority. The second experiment showed that message outcome moderates when a majority or a minority leads to systematic processing. When the message argued for a negative personal outcome, there was systematic processing only for the majority source; but when the message did not argue for a negative personal outcome, there was systematic processing only for the minority source. Thus one key moderator of whether a majority or minority source leads to message processing is whether the topic induces defensive processing motivated by self-interest. Copyright (C) 2002 John Wiley Sons, Ltd.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

This chapter examines the contexts in which people will process more deeply, and therefore be more influenced by, a position that is supported by either a numerical majority or minority. The chapter reviews the major theories of majority and minority influence with reference to which source condition is associated with most message processing (and where relevant, the contexts under which this occurs) and experimental research examining these predictions. The chapter then presents a new theoretical model (the source-context-elaboration model, SCEM) that aims to integrate the disparate research findings. The model specifies the processes underlying majority and minority influence, the contexts under which these processes occur and the consequences for attitudes changed by majority and minority influence. The chapter then describes a series of experiments that address each of the aspects of the theoretical model. Finally, a range of research-related issues are discussed and future issues for the research area as a whole are considered.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Two experiments examined the extent to which attitudes changed following majority and minority influence are resistant to counter-persuasion. In both experiments participants' attitudes were measured after being exposed to two messages, delayed in time, which argued opposite positions (initial message and counter-message). In the first experiment, attitudes following minority endorsement of the initial message were more resistant to a second counter-message only when the initial message contained strong versus weak arguments. Attitudes changed following majority influence did not resist the second counter-message and returned to their pre-test level. Experiment 2 varied whether memory was warned (i.e., message recipients expected to recall the message) or not, to manipulate message processing. When memory was warned, which should increase message processing, attitudes changed following both majority and minority influence resisted the second counter-message. The results support the view that minority influence instigates systematic processing of its arguments, leading to attitudes that resist counter-persuasion. Attitudes formed following majority influence yield to counter-persuasion unless there is a secondary task that encourages message processing.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Two experiments investigated the extent of message processing of a persuasive communication proposed by either a numerical majority or minority. Both experiments crossed source status (majority versus minority) with message quality (strong versus weak arguments) to determine which source condition is associated with systematic processing. The first experiment showed a reliable difference between strong and weak messages, indicating systematic processing had occurred, for a minority irrespective of message direction (pro- versus counter-attitudinal), but not for a majority. The second experiment showed that message outcome moderates when a majority or a minority leads to systematic processing. When the message argued for a negative personal outcome, there was systematic processing only for the majority source; but when the message did not argue for a negative personal outcome, there was systematic processing only for the minority source. Thus one key moderator of whether a majority or minority source leads to message processing is whether the topic induces defensive processing motivated by self-interest.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Two experiments investigated the conditions under which majority and minority sources instigate systematic processing of their messages. Both experiments crossed source status (majority vs. minority) with message quality (strong vs. weak arguments). In each experiment, message elaboration was manipulated by varying either motivational (outcome relevance, Experiment 1) or cognitive (orientating tasks, Experiment 2) factors. The results showed that when either motivational or cognitive factors encouraged low message elaboration, there was heuristic acceptance of the majority position without detailed message processing. When the level of message elaboration was intermediate, there was message processing only for the minority source. Finally, when message elaboration was high, there was message processing for both source conditions. These results show that majority and minority influence is sensitive to motivational and cognitive factors that constrain or enhance message elaboration and that both sources can lead to systematic processing under specific circumstances. © 2007 by the Society for Personality and Social Psychology, Inc.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The problem of determining the script and language of a document image has a number of important applications in the field of document analysis, such as indexing and sorting of large collections of such images, or as a precursor to optical character recognition (OCR). In this paper, we investigate the use of texture as a tool for determining the script of a document image, based on the observation that text has a distinct visual texture. An experimental evaluation of a number of commonly used texture features is conducted on a newly created script database, providing a qualitative measure of which features are most appropriate for this task. Strategies for improving classification results in situations with limited training data and multiple font types are also proposed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper addresses the problem of resolving ambiguities in frequently confused online Tamil character pairs by employing script specific algorithms as a post classification step. Robust structural cues and temporal information of the preprocessed character are extensively utilized in the design of these algorithms. The methods are quite robust in automatically extracting the discriminative sub-strokes of confused characters for further analysis. Experimental validation on the IWFHR Database indicates error rates of less than 3 % for the confused characters. Thus, these post processing steps have a good potential to improve the performance of online Tamil handwritten character recognition.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper describes a semi-automatic tool for annotation of multi-script text from natural scene images. To our knowledge, this is the maiden tool that deals with multi-script text or arbitrary orientation. The procedure involves manual seed selection followed by a region growing process to segment each word present in the image. The threshold for region growing can be varied by the user so as to ensure pixel-accurate character segmentation. The text present in the image is tagged word-by-word. A virtual keyboard interface has also been designed for entering the ground truth in ten Indic scripts, besides English. The keyboard interface can easily be generated for any script, thereby expanding the scope of the toolkit. Optionally, each segmented word can further be labeled into its constituent characters/symbols. Polygonal masks are used to split or merge the segmented words into valid characters/symbols. The ground truth is represented by a pixel-level segmented image and a '.txt' file that contains information about the number of words in the image, word bounding boxes, script and ground truth Unicode. The toolkit, developed using MATLAB, can be used to generate ground truth and annotation for any generic document image. Thus, it is useful for researchers in the document image processing community for evaluating the performance of document analysis and recognition techniques. The multi-script annotation toolokit (MAST) is available for free download.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this work, we describe a system, which recognises open vocabulary, isolated, online handwritten Tamil words and extend it to recognize a paragraph of writing. We explain in detail each step involved in the process: segmentation, preprocessing, feature extraction, classification and bigram-based post-processing. On our database of 45,000 handwritten words obtained through tablet PC, we have obtained symbol level accuracy of 78.5% and 85.3% without and with the usage of post-processing using symbol level language models, respectively. Word level accuracies for the same are 40.1% and 59.6%. A line and word level segmentation strategy is proposed, which gives promising results of 100% line segmentation and 98.1% word segmentation accuracies on our initial trials of 40 handwritten paragraphs. The two modules have been combined to obtain a full-fledged page recognition system for online handwritten Tamil data. To the knowledge of the authors, this is the first ever attempt on recognition of open vocabulary, online handwritten paragraphs in any Indian language.