The inadequacy of embedded markup for cultural heritage texts


Autoria(s): Schmidt, Desmond Allan
Data(s)

01/09/2010

Resumo

Embedded generalized markup, as applied by digital humanists to the recording and studying of our textual cultural heritage, suffers from a number of serious technical drawbacks. As a result of its evolution from early printer control languages, generalized markup can only express a document’s ‘logical’ structure via a repertoire of permissible printed format structures. In addition to the well-researched overlap problem, the embedding of markup codes into texts that never had them when written leads to a number of further difficulties: the inclusion of potentially obsolescent technical and subjective information into texts that are supposed to be archivable for the long term, the manual encoding of information that could be better computed automatically, and the obscuring of the text by highly complex technical data. Many of these problems can be alleviated by asserting a separation between the versions of which many cultural heritage texts are composed, and their content. In this way the complex inter-connections between versions can be handled automatically, leaving only simple markup for individual versions to be handled by the user.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/38436/

Publicador

Oxford University Press

Relação

http://eprints.qut.edu.au/38436/1/c38436.pdf

DOI:10.1093/llc/fqq007

Schmidt, Desmond Allan (2010) The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.

Direitos

Copyright 2010 Oxford University Press

Fonte

Information Security Institute

Palavras-Chave #089900 OTHER INFORMATION AND COMPUTING SCIENCES #markup #cultural heritage
Tipo

Journal Article