877 resultados para Textual genres
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Pós-graduação em Educação - FFC
Resumo:
Pós-graduação em Letras - FCLAR
Resumo:
The importance of the new textual genres such as blogs or forum entries is growing in parallel with the evolution of the Social Web. This paper presents two corpora of blog posts in English and in Spanish, annotated according to the EmotiBlog annotation scheme. Furthermore, we created 20 factual and opinionated questions for each language and also the Gold Standard for their answers in the corpus. The purpose of our work is to study the challenges involved in a mixed fact and opinion question answering setting by comparing the performance of two Question Answering (QA) systems as far as mixed opinion and factual setting is concerned. The first one is open domain, while the second one is opinion-oriented. We evaluate separately the two systems in both languages and propose possible solutions to improve QA systems that have to process mixed questions.
Resumo:
The development of the Web 2.0 led to the birth of new textual genres such as blogs, reviews or forum entries. The increasing number of such texts and the highly diverse topics they discuss make blogs a rich source for analysis. This paper presents a comparative study on open domain and opinion QA systems. A collection of opinion and mixed fact-opinion questions in English is defined and two Question Answering systems are employed to retrieve the answers to these queries. The first one is generic, while the second is specific for emotions. We comparatively evaluate and analyze the systems’ results, concluding that opinion Question Answering requires the use of specific resources and methods.
Resumo:
The exponential growth of the subjective information in the framework of the Web 2.0 has led to the need to create Natural Language Processing tools able to analyse and process such data for multiple practical applications. They require training on specifically annotated corpora, whose level of detail must be fine enough to capture the phenomena involved. This paper presents EmotiBlog – a fine-grained annotation scheme for subjectivity. We show the manner in which it is built and demonstrate the benefits it brings to the systems using it for training, through the experiments we carried out on opinion mining and emotion detection. We employ corpora of different textual genres –a set of annotated reported speech extracted from news articles, the set of news titles annotated with polarity and emotion from the SemEval 2007 (Task 14) and ISEAR, a corpus of real-life self-expressed emotion. We also show how the model built from the EmotiBlog annotations can be enhanced with external resources. The results demonstrate that EmotiBlog, through its structure and annotation paradigm, offers high quality training data for systems dealing both with opinion mining, as well as emotion detection.
Resumo:
EmotiBlog is a corpus labelled with the homonymous annotation schema designed for detecting subjectivity in the new textual genres. Preliminary research demonstrated its relevance as a Machine Learning resource to detect opinionated data. In this paper we compare EmotiBlog with the JRC corpus in order to check the EmotiBlog robustness of annotation. For this research we concentrate on its coarse-grained labels. We carry out a deep ML experimentation also with the inclusion of lexical resources. The results obtained show a similarity with the ones obtained with the JRC demonstrating the EmotiBlog validity as a resource for the SA task.
Resumo:
I propose a method to study interactional ironic humorous utterances in Spanish. In GRIALE research group consider this method can be applied to humorous ironic utterances in different textual genres, from the violation of conversational principles. Futhermore, we present the General Theory of Verbal Humor proposed by Attardo that it will be taken in our analysis. Therefore, I study irony and humor in examples of conversations from Peninsular Spanish real sample corpuses (COVJA, Corpus de conversaciones coloquiales [Corpus of Colloquial Conversations] and CREA, Corpus de Referencia del Español Actual [Reference Corpus of Present-Day Spanish]). In this article, I will focus on the application of this theory to humorous ironic statements which arise in conversation and examine the effects caused by them, which will additionally verify if irony and humor coexist in the same conversational exchange with a communicative aim and conversational strategies.
Resumo:
Researchers from the GRIALE group (Irony and Humour Research Group) have developed a theoretical method that can be applied to humorous ironic utterances in different textual genres, depending on the degree of the violation of conversational principles in conversation. In addition to this, the General Theory of Verbal Humor (Attardo and Raskin, 1991) will be taken into account in the analysis. Therefore, I will study irony and humour in conversational utterances in real examples of Peninsular Spanish obtained from the COVJA, (Corpus de conversaciones coloquiales [Corpus of Colloquial Conversations]) and CREA, (Corpus de Referencia del Español Actual [Reference Corpus of Present-Day Spanish]). The focus of this paper is then the application of the aforementioned theories to humorous ironic statements which arise in conversation. I will also examine the positive or negative effects caused by them, which will additionally verify if irony and humour coexist in the same conversational exchange, and if this has a communicative goal.
Resumo:
Domain specific information retrieval has become in demand. Not only domain experts, but also average non-expert users are interested in searching domain specific (e.g., medical and health) information from online resources. However, a typical problem to average users is that the search results are always a mixture of documents with different levels of readability. Non-expert users may want to see documents with higher readability on the top of the list. Consequently the search results need to be re-ranked in a descending order of readability. It is often not practical for domain experts to manually label the readability of documents for large databases. Computational models of readability needs to be investigated. However, traditional readability formulas are designed for general purpose text and insufficient to deal with technical materials for domain specific information retrieval. More advanced algorithms such as textual coherence model are computationally expensive for re-ranking a large number of retrieved documents. In this paper, we propose an effective and computationally tractable concept-based model of text readability. In addition to textual genres of a document, our model also takes into account domain specific knowledge, i.e., how the domain-specific concepts contained in the document affect the document’s readability. Three major readability formulas are proposed and applied to health and medical information retrieval. Experimental results show that our proposed readability formulas lead to remarkable improvements in terms of correlation with users’ readability ratings over four traditional readability measures.
Resumo:
The study aims to analyze the crime of the advertising process in the post-World War II period in Brazil, considering the Tribuna do Norte newspaper as one of the main vectors of this production in the public sphere of Rio Grande do Norte. The theoretical discussion is based on sociologists Jürgen Habermas and John Thompson, among others, that bring ideas about the relationship between the press and the public space. Our research in the journal is during the period from 1950, the year of the creation of this press, to 1970, in the context of AI-5 law. This period is considered the consolidation of this periodic in the populist context of Aluízio Alves, as well as the articulation with political changes after and before military coup in 1964. The publicity of crime is showed as a historical building, involving journalistic procedures, subjects and spaces. The publicity is related to commercial and political questions when some facts turned into a public event. In this sense, this research focuses on the publicity in its political dimensions. Related to the methodology, it is an empirical and qualitative study, based on literature, with a descriptive and interpretative approach, according to historian Tânia de Luca. The corpus of analyze is composed by notes, titles, news, reports, advertisements, image texts, among another textual genres. The chapters present a study about the building and changes of the populist journalism; the publicity of crime in democratic times; besides the military coup in 1964 and the changes of publicity of crime. The results of analyzes show that Tribuna do Norte, although has adopted more liberal pattern from North American presses, during the analyzed period has yet conservative and authoritative patterns from old potiguar presses. In this period, the political practice, in spite of diverse commercial interests, was an important element in the trajectory of this ambiguous journalism that has influencing, in a significant way, the production of news of crime.
Resumo:
The Web 2.0 has resulted in a shift as to how users consume and interact with the information, and has introduced a wide range of new textual genres, such as reviews or microblogs, through which users communicate, exchange, and share opinions. The exploitation of all this user-generated content is of great value both for users and companies, in order to assist them in their decision-making processes. Given this context, the analysis and development of automatic methods that can help manage online information in a quicker manner are needed. Therefore, this article proposes and evaluates a novel concept-level approach for ultra-concise opinion abstractive summarization. Our approach is characterized by the integration of syntactic sentence simplification, sentence regeneration and internal concept representation into the summarization process, thus being able to generate abstractive summaries, which is one the most challenging issues for this task. In order to be able to analyze different settings for our approach, the use of the sentence regeneration module was made optional, leading to two different versions of the system (one with sentence regeneration and one without). For testing them, a corpus of 400 English texts, gathered from reviews and tweets belonging to two different domains, was used. Although both versions were shown to be reliable methods for generating this type of summaries, the results obtained indicate that the version without sentence regeneration yielded to better results, improving the results of a number of state-of-the-art systems by 9%, whereas the version with sentence regeneration proved to be more robust to noisy data.
Resumo:
The use of new technologies in classroom, including in foreign language learning, is becoming more and more frequent. The researches already show investigations with the use of emails, chats, blogs, hypertexts, facebook in foreign language classroom especially to develop the students’ written ability. However, there is still a lack of studies concerning educational blogs in foreign language as a digital genre. This work aim characterizing educational blog as a genre under Bakhtinian perspective. To achieve these objectives, the methodology used is characterized as exploratory-descriptive with quali-quantitative base whose collected data consist in analysis of seven educational blogs. The data analysis was done using the theoretical assumptions about textual genres under Bakhtinian perspective. The main results indicate that the analyzed blogs have thematic content, composition and style that are characteristical as digital genres. In conclusion of this work, suggestions are made for future researches into the use of blogs in the foreign language classroom.
Resumo:
The focus on the study of word classes from a morphosyntactic-semantic approach seems to have been somewhat neglected when compared to the attention given to textual genres. We do not say that different textual genres should not be considered, on the contrary, they should be studied, but it is also necessary to remember the importance of word classes in terms of their function as cohesive, modal, hyperonym, argumentative, substitution and reference elements among various other functions in reading and in text production. This paper presents a reflection about the importance of a theoretical view for the production and comprehension of various text genres.