ScriptLattes: an open-source knowledge extraction system from the Lattes platform


Autoria(s): MENA-CHALCO, Jesús Pascual; CESAR JUNIOR, Roberto Marcondes
Contribuinte(s)

UNIVERSIDADE DE SÃO PAULO

Data(s)

26/03/2012

26/03/2012

2009

Resumo

The Lattes platform is the major scientific information system maintained by the National Council for Scientific and Technological Development (CNPq). This platform allows to manage the curricular information of researchers and institutions working in Brazil based on the so called Lattes Curriculum. However, the public information is individually available for each researcher, not providing the automatic creation of reports of several scientific productions for research groups. It is thus difficult to extract and to summarize useful knowledge for medium to large size groups of researchers. This paper describes the design, implementation and experiences with scriptLattes: an open-source system to create academic reports of groups based on curricula of the Lattes Database. The scriptLattes system is composed by the following modules: (a) data selection, (b) data preprocessing, (c) redundancy treatment, (d) collaboration graph generation among group members, (e) research map generation based on geographical information, and (f) automatic report creation of bibliographical, technical and artistic production, and academic supervisions. The system has been extensively tested for a large variety of research groups of Brazilian institutions, and the generated reports have shown an alternative to easily extract knowledge from data in the context of Lattes platform. The source code, usage instructions and examples are available at http://scriptlattes.sourceforge.net/.

Coordenacao de Aperfeicoamento de Pessoal de Nivel Superior (CAPES)

CNPq

FAPESP

Identificador

Journal of the Brazilian Computer Society, v.15, n.4, p.31-39, 2009

0104-6500

http://producao.usp.br/handle/BDPI/11957

10.1590/S0104-65002009000400004

http://www.scielo.br/scielo.php?script=sci_arttext&pid=S0104-65002009000400004

http://www.scielo.br/pdf/jbcos/v15n4/04.pdf

Idioma(s)

eng

Publicador

Sociedade Brasileira de Computação

Relação

Journal of the Brazilian Computer Society

Direitos

openAccess

Copyright Sociedade Brasileira de Computação

Palavras-Chave #Academic production report #Lattes platform #Knowledge discovery
Tipo

article

original article

publishedVersion