2 resultados para Portuguese fiction

em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)


Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper, first result of a larger research, proposes a query about some aspects of social representation of libraries and librarians, as they appear in literary and cinematographic productions. Little by little, this query, which arose from purposes of organizing catalogues, revealed elements that established different series, in which the narrative genre (literary or cinematographic) has no relevance to either libraries or librarians` representations. The presence of these elements seems to show some expectations and utopias in relation to the common knowledge, independently from narratives being located in the past, in the present or in the future, stimulating reflection on some medieval and baroque traditions about the library universe and its main characters, the librarians. The cinematographic material selected for research was The time machine, Farenheit 451, The day after tomorrow, Star Wars - episode II and the novels Martin Eden, The man without qualities, The time machine and La sombra del viento.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The amount of textual information digitally stored is growing every day. However, our capability of processing and analyzing that information is not growing at the same pace. To overcome this limitation, it is important to develop semiautomatic processes to extract relevant knowledge from textual information, such as the text mining process. One of the main and most expensive stages of the text mining process is the text pre-processing stage, where the unstructured text should be transformed to structured format such as an attribute-value table. The stemming process, i.e. linguistics normalization, is usually used to find the attributes of this table. However, the stemming process is strongly dependent on the language in which the original textual information is given. Furthermore, for most languages, the stemming algorithms proposed in the literature are computationally expensive. In this work, several improvements of the well know Porter stemming algorithm for the Portuguese language, which explore the characteristics of this language, are proposed. Experimental results show that the proposed algorithm executes in far less time without affecting the quality of the generated stems.