4 resultados para Word Sense Disambiguation
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo
Resumo:
Complex networks have been employed to model many real systems and as a modeling tool in a myriad of applications. In this paper, we use the framework of complex networks to the problem of supervised classification in the word disambiguation task, which consists in deriving a function from the supervised (or labeled) training data of ambiguous words. Traditional supervised data classification takes into account only topological or physical features of the input data. On the other hand, the human (animal) brain performs both low- and high-level orders of learning and it has facility to identify patterns according to the semantic meaning of the input data. In this paper, we apply a hybrid technique which encompasses both types of learning in the field of word sense disambiguation and show that the high-level order of learning can really improve the accuracy rate of the model. This evidence serves to demonstrate that the internal structures formed by the words do present patterns that, generally, cannot be correctly unveiled by only traditional techniques. Finally, we exhibit the behavior of the model for different weights of the low- and high-level classifiers by plotting decision boundaries. This study helps one to better understand the effectiveness of the model. Copyright (C) EPLA, 2012
Resumo:
The automatic disambiguation of word senses (i.e., the identification of which of the meanings is used in a given context for a word that has multiple meanings) is essential for such applications as machine translation and information retrieval, and represents a key step for developing the so-called Semantic Web. Humans disambiguate words in a straightforward fashion, but this does not apply to computers. In this paper we address the problem of Word Sense Disambiguation (WSD) by treating texts as complex networks, and show that word senses can be distinguished upon characterizing the local structure around ambiguous words. Our goal was not to obtain the best possible disambiguation system, but we nevertheless found that in half of the cases our approach outperforms traditional shallow methods. We show that the hierarchical connectivity and clustering of words are usually the most relevant features for WSD. The results reported here shed light on the relationship between semantic and structural parameters of complex networks. They also indicate that when combined with traditional techniques the complex network approach may be useful to enhance the discrimination of senses in large texts. Copyright (C) EPLA, 2012
Resumo:
For these Russian authors, a sign has to be faithful to reality but what is, in fact, «to be faithful», what is «reality»? They suggest that thought structures itself only by means of signs – as Peirce, who denies the reality of dreams saying that the act to feel hunger is an ideological expression and the shouts of a new-born are already appreciative manifestations of this new human being. The authors had inspired the structuralism, saying that a «semiodiscourse» structures men. Although this instance, word remains neutral, assertion strange to their Hegelian and Marxist roots; their paradigm in contrast, can be Heideggerian, according to which, only the «marked» being exists: looking at one determined thing, I place it, I fit it in its context. To place something is to attribute sense and that is more Stoic than, in fact, Marxist.
Resumo:
Sugarcane is an important sugar and energy crop that can be used efficiently for biofuels production. The development of sugarcane cultivars tolerant to drought could allow for the expansion of plantations to sub-prime regions. Knowledge on the mechanisms underlying drought responses and its relationship with carbon partition would greatly help to define routes to increase yield. In this work we studied sugarcane responses to drought using a custom designed oligonucleotide array with 21,901 different probes. The oligoarrays were designed to contain probes that detect transcription in both sense and antisense orientation. We validated the results obtained using quantitative real-time PCR (qPCR). A total of 987 genes were differentially expressed in at least one sample of sugarcane plants submitted to drought for 24, 72 and 120 h. Among them, 928 were sense transcripts and 59 were antisense transcripts. Genes related to Carbohydrate Metabolism, RNA Metabolism and Signal Transduction were selected for gene expression validation by qPCR that indicated a validation percentage of 90 %. From the probes presented on the array, 75 % of the sense probes and 11.9 % of the antisense probes have signal above background and can be classified as expressed sequences. Our custom sugarcane oligonucleotide array provides sensitivity and good coverage of sugarcane transcripts for the identification of a representative proportion of natural antisense transcripts (NATs) and sense-antisense transcript pairs (SATs). The antisense transcriptome showed, in most cases, co-expression with respective sense transcripts.