1000 resultados para Semistructured documents


Relevância:

100.00% 100.00%

Publicador:

Resumo:

The increasing amount of available semistructured data demands efficient mechanisms to store, process, and search an enormous corpus of data to encourage its global adoption. Current techniques to store semistructured documents either map them to relational databases, or use a combination of flat files and indexes. These two approaches result in a mismatch between the tree-structure of semistructured data and the access characteristics of the underlying storage devices. Furthermore, the inefficiency of XML parsing methods has slowed down the large-scale adoption of XML into actual system implementations. The recent development of lazy parsing techniques is a major step towards improving this situation, but lazy parsers still have significant drawbacks that undermine the massive adoption of XML. ^ Once the processing (storage and parsing) issues for semistructured data have been addressed, another key challenge to leverage semistructured data is to perform effective information discovery on such data. Previous works have addressed this problem in a generic (i.e. domain independent) way, but this process can be improved if knowledge about the specific domain is taken into consideration. ^ This dissertation had two general goals: The first goal was to devise novel techniques to efficiently store and process semistructured documents. This goal had two specific aims: We proposed a method for storing semistructured documents that maps the physical characteristics of the documents to the geometrical layout of hard drives. We developed a Double-Lazy Parser for semistructured documents which introduces lazy behavior in both the pre-parsing and progressive parsing phases of the standard Document Object Model’s parsing mechanism. ^ The second goal was to construct a user-friendly and efficient engine for performing Information Discovery over domain-specific semistructured documents. This goal also had two aims: We presented a framework that exploits the domain-specific knowledge to improve the quality of the information discovery process by incorporating domain ontologies. We also proposed meaningful evaluation metrics to compare the results of search systems over semistructured documents. ^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The increasing amount of available semistructured data demands efficient mechanisms to store, process, and search an enormous corpus of data to encourage its global adoption. Current techniques to store semistructured documents either map them to relational databases, or use a combination of flat files and indexes. These two approaches result in a mismatch between the tree-structure of semistructured data and the access characteristics of the underlying storage devices. Furthermore, the inefficiency of XML parsing methods has slowed down the large-scale adoption of XML into actual system implementations. The recent development of lazy parsing techniques is a major step towards improving this situation, but lazy parsers still have significant drawbacks that undermine the massive adoption of XML. Once the processing (storage and parsing) issues for semistructured data have been addressed, another key challenge to leverage semistructured data is to perform effective information discovery on such data. Previous works have addressed this problem in a generic (i.e. domain independent) way, but this process can be improved if knowledge about the specific domain is taken into consideration. This dissertation had two general goals: The first goal was to devise novel techniques to efficiently store and process semistructured documents. This goal had two specific aims: We proposed a method for storing semistructured documents that maps the physical characteristics of the documents to the geometrical layout of hard drives. We developed a Double-Lazy Parser for semistructured documents which introduces lazy behavior in both the pre-parsing and progressive parsing phases of the standard Document Object Model's parsing mechanism. The second goal was to construct a user-friendly and efficient engine for performing Information Discovery over domain-specific semistructured documents. This goal also had two aims: We presented a framework that exploits the domain-specific knowledge to improve the quality of the information discovery process by incorporating domain ontologies. We also proposed meaningful evaluation metrics to compare the results of search systems over semistructured documents.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Universidade Estadual de Campinas . Faculdade de Educação Física

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Em um contexto de ampliação dos lugares públicos participativos no Brasil há de se considerar expectativas de despertar valores sociopolíticos nos estudantes universitários em seu processo de qualificação cidadã e profissional, diante das críticas à formação dos administradores. Portanto, este trabalho visa compreender a dinâmica da consciência política dos estudantes da graduação em administração de uma universidade pública federal no sudeste do Brasil em sua relação com a participação cidadã nos lugares públicos participativos no estado e municípios. Adota-se o modelo analítico de consciência política para a compreensão da participação em ações coletivas de Sandoval (2001) como marco teórico, associado à literatura sobre participação cidadã. Trata-se de uma pesquisa qualitativa, cujos dados foram coletados através de documentos, aplicação de 30 questionários e 17 entrevistas semiestruturadas, com 30 estudantes universitários da graduação em administração matriculados em 2014/1. Os dados foram submetidos à análise de conteúdo (BARDIN, 2004). Os resultados revelam 12 estudantes que não participam nos lugares públicos participativos e 18 estudantes que participam em pelo menos um destes lugares. O interesse em exercer a cidadania, melhorar as políticas públicas, gostar de implicar-se com os assuntos públicos e defender seus interesses em circunstâncias de conflito são as justificativas citadas pelos que participam. Evidenciam-se nos estudantes com participação mais ativa, crenças, valores e expectativas societais, articuladas à eficácia política, identidade coletiva, interesses antagônicos, sentimentos de justiça e injustiça, favorecendo a vontade de agir coletivamente, devido à percepção de conexão de seus interesses com as metas e ações coletivas dos movimentos que se envolvem. Os estudantes que não participam desconfiam dos lugares públicos participativos e demonstram desinteresse pelos assuntos públicos, embora apontem um desconforto em não participar. Suas crenças, valores e expectativas societais, associadas aos sentimentos de ineficácia política dificultam o desenvolvimento da consciência política. Conclui-se que estes estudantes possuem uma consciência política de senso comum, demonstrando valores sociais e políticos inerentes aos modismos presentes na vida cotidiana das pessoas. Já os estudantes com participação mais ativa apresentam uma consciência política de conflito, motivando-os à participação nos lugares avaliados como eficazes às suas proposições. Entretanto, o Centro Acadêmico Livre de Administração Honestino Guimarães (CALAD), principal lugar de representação e participação dos interesses dos estudantes no curso, encontra-se sem direção e participação nas instâncias institucionalizadas na universidade.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Dissertação de mestrado integrado em Engenharia e Gestão de Sistemas de Informação

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Much of the information of historical documents about the territory and property are defined on textual form. This information is mostly geographic and defines territorial areas, its limits and boundaries. For the treatment of this data, we have defined one information system where the treatment of the documental references for the study of the settlement and landscape implies a systematization of the information, normalization, integration and graphic and cartographic representation. This methodology was applied to the case study of the boundary of the monastery-diocese of Dume, in Braga - Portugal, for which there are countless documents and references to this site, but where the urban pressure has mischaracterized very significantly the landscape, making the identification of territorial limits quite difficult. The work carried out to give spatial and cartographic expression to the data, by defining viewing criteria according to the recorded information, proved to be a central working tool in the boundary study and in understanding the dynamics of the sites in the various cultural periods.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Magdeburg, Univ., Fak. für Informatik, Diss., 2010