975 resultados para semantic content annotation
Resumo:
In French, a causal relation is often conveyed by the connectives car, parce que or puisque. Since the seminal work of the Lambda-l Group (1975), it has generally been assumed that parce que, used to relate semantic content, contrasts with car and puisque, both used to connect either speech act or epistemic content. However, this analysis leaves a number of questions unanswered. In this paper, I present a reanalysis of this trio, using empirical methods such as corpus analysis and constrained elicitation. Results indicate that car and parce que are interchangeable in many contexts, even if they are still prototypically used in their respective domain in writing. As for puisque, its distribution does not overlap with car, despite their similar domains of use. I argue that the specificity of puisque with respect to the other two connectives is to introduce a cause with an echoic meaning.
USO DE TEORIAS NO CAMPO DE SISTEMAS DE INFORMAÇÃO: MAPEAMENTO USANDO TÉCNICAS DE MINERAÇÃO DE TEXTOS
Resumo:
Esta dissertação visa apresentar o mapeamento do uso das teorias de sistemas de informações, usando técnicas de recuperação de informação e metodologias de mineração de dados e textos. As teorias abordadas foram Economia de Custos de Transações (Transactions Costs Economics TCE), Visão Baseada em Recursos da Firma (Resource-Based View-RBV) e Teoria Institucional (Institutional Theory-IT), sendo escolhidas por serem teorias de grande relevância para estudos de alocação de investimentos e implementação em sistemas de informação, tendo como base de dados o conteúdo textual (em inglês) do resumo e da revisão teórica dos artigos dos periódicos Information System Research (ISR), Management Information Systems Quarterly (MISQ) e Journal of Management Information Systems (JMIS) no período de 2000 a 2008. Os resultados advindos da técnica de mineração textual aliada à mineração de dados foram comparadas com a ferramenta de busca avançada EBSCO e demonstraram uma eficiência maior na identificação de conteúdo. Os artigos fundamentados nas três teorias representaram 10% do total de artigos dos três períodicos e o período mais profícuo de publicação foi o de 2001 e 2007.(AU)
USO DE TEORIAS NO CAMPO DE SISTEMAS DE INFORMAÇÃO: MAPEAMENTO USANDO TÉCNICAS DE MINERAÇÃO DE TEXTOS
Resumo:
Esta dissertação visa apresentar o mapeamento do uso das teorias de sistemas de informações, usando técnicas de recuperação de informação e metodologias de mineração de dados e textos. As teorias abordadas foram Economia de Custos de Transações (Transactions Costs Economics TCE), Visão Baseada em Recursos da Firma (Resource-Based View-RBV) e Teoria Institucional (Institutional Theory-IT), sendo escolhidas por serem teorias de grande relevância para estudos de alocação de investimentos e implementação em sistemas de informação, tendo como base de dados o conteúdo textual (em inglês) do resumo e da revisão teórica dos artigos dos periódicos Information System Research (ISR), Management Information Systems Quarterly (MISQ) e Journal of Management Information Systems (JMIS) no período de 2000 a 2008. Os resultados advindos da técnica de mineração textual aliada à mineração de dados foram comparadas com a ferramenta de busca avançada EBSCO e demonstraram uma eficiência maior na identificação de conteúdo. Os artigos fundamentados nas três teorias representaram 10% do total de artigos dos três períodicos e o período mais profícuo de publicação foi o de 2001 e 2007.(AU)
Resumo:
Pouca atenção tem merecido o estudo dos deveres instrumentais tributários pelos estudiosos do direito tributário em nosso país, com a preocupação de conferir contornos nítidos ao regime jurídico dos deveres instrumentais dentro do sistema tributário brasileiro e, em especial, de examinar a quais limitações está adstrita a Administração Pública na imposição desses deveres. O presente trabalho visa tentar suprir, em alguma medida, essa lacuna, promovendo uma análise das limitações à imposição de deveres instrumentais tributários, que leve em consideração, não apenas os princípios que conformam seu regime jurídico, mas, principalmente, a existência de regras objetivas disciplinando o tema, partindo-se da premissa de que, genericamente, dicções principiológicas, por sua abstração, não são suficientes para a adequada regulação das condutas intersubjetivas, seja entre particulares, seja entre estes e o Poder Público. Merecerá especial atenção a regra inserta no art. 113, §2º do Código Tributário Nacional, de forte vocação limitadora, especificamente no que tange à investigação do conteúdo semântico da expressão interesse da arrecadação ou da fiscalização dos tributos, que, a nosso ver, constitui a pedra-de-toque do regime jurídico dos deveres instrumentais e das sanções punitivas impostas em virtude de seu descumprimento. Por fim, buscar-se-á conferir a devida importância aos custos de conformidades e demonstrar que seu estudo é relevante para o sistema tributário, na medida em que tais custos, enquanto efeito econômico da imposição de deveres instrumentais, implicam efeitos relevantes no âmbito jurídico, inclusive restrições no âmbito de proteção de direitos fundamentais dos contribuintes.
Resumo:
El presente artículo tiene como finalidad, valorar el impacto ambiental de las conducciones del Canal de Isabel II en el contexto del paisaje. Partimos de la idea según la cual, las infraestructuras del Canal de Isabel II, y más que formar parte del paisaje por el que se extienden, son el propio paisaje. Nuestra zona de estudio es el noroeste de la Comunidad de Madrid (síntesis de la interacción de los propios agentes naturales, de la ocupación humana y de los usos del suelo), área a la que nos aproximarnos a través de la investigación de la integración paisajística, entendida ésta como una estrategia de intervención en el territorio, que tiene como objetivo principal orientar las transformaciones del paisaje o corregir las ya realizadas, para conseguir su adaptación al propio paisaje. En definitiva, nos encontramos ante la necesidad de ajustar un objeto o actuación territorial a las características fisonómicas de un paisaje dado, o de algunos de sus componentes, así como a su carácter y a sus contenidos semánticos.
Resumo:
Basic grammatical categories may carry social meaning irrespective of their semantic content. In a set of four studies, we demonstrate that verbs – a basic linguistic category present and distinguishable in most languages – are related to the perception of agency, a fundamental dimension in social perception. In an archival analysis on actual language use in Polish and German, we found that targets stereotypically associated with high agency (men and young people) are presented in the immediate neighborhood of a verb more often than non-agentic social targets (women and old people). Moreover, in three experiments using a pseudo-word paradigm, verbs (but not adjectives and nouns) were consistently associated with agency (but not communion). These results provide consistent evidence that verbs, as grammatical vehicles of action, are linguistic markers of agency. In demonstrating meta-semantic effects of language, these studies corroborate the view of language as a social tool and of language as an integral part of social perception.
USO DE TEORIAS NO CAMPO DE SISTEMAS DE INFORMAÇÃO: MAPEAMENTO USANDO TÉCNICAS DE MINERAÇÃO DE TEXTOS
Resumo:
Esta dissertação visa apresentar o mapeamento do uso das teorias de sistemas de informações, usando técnicas de recuperação de informação e metodologias de mineração de dados e textos. As teorias abordadas foram Economia de Custos de Transações (Transactions Costs Economics TCE), Visão Baseada em Recursos da Firma (Resource-Based View-RBV) e Teoria Institucional (Institutional Theory-IT), sendo escolhidas por serem teorias de grande relevância para estudos de alocação de investimentos e implementação em sistemas de informação, tendo como base de dados o conteúdo textual (em inglês) do resumo e da revisão teórica dos artigos dos periódicos Information System Research (ISR), Management Information Systems Quarterly (MISQ) e Journal of Management Information Systems (JMIS) no período de 2000 a 2008. Os resultados advindos da técnica de mineração textual aliada à mineração de dados foram comparadas com a ferramenta de busca avançada EBSCO e demonstraram uma eficiência maior na identificação de conteúdo. Os artigos fundamentados nas três teorias representaram 10% do total de artigos dos três períodicos e o período mais profícuo de publicação foi o de 2001 e 2007.(AU)
Resumo:
Previous research into formulaic language has focussed on specialised groups of people (e.g. L1 acquisition by infants and adult L2 acquisition) with ordinary adult native speakers of English receiving less attention. Additionally, whilst some features of formulaic language have been used as evidence of authorship (e.g. the Unabomber’s use of you can’t eat your cake and have it too) there has been no systematic investigation into this as a potential marker of authorship. This thesis reports the first full-scale study into the use of formulaic sequences by individual authors. The theory of formulaic language hypothesises that formulaic sequences contained in the mental lexicon are shaped by experience combined with what each individual has found to be communicatively effective. Each author’s repertoire of formulaic sequences should therefore differ. To test this assertion, three automated approaches to the identification of formulaic sequences are tested on a specially constructed corpus containing 100 short narratives. The first approach explores a limited subset of formulaic sequences using recurrence across a series of texts as the criterion for identification. The second approach focuses on a word which frequently occurs as part of formulaic sequences and also investigates alternative non-formulaic realisations of the same semantic content. Finally, a reference list approach is used. Whilst claiming authority for any reference list can be difficult, the proposed method utilises internet examples derived from lists prepared by others, a procedure which, it is argued, is akin to asking large groups of judges to reach consensus about what is formulaic. The empirical evidence supports the notion that formulaic sequences have potential as a marker of authorship since in some cases a Questioned Document was correctly attributed. Although this marker of authorship is not universally applicable, it does promise to become a viable new tool in the forensic linguist’s tool-kit.
Resumo:
(ITA) L’industria mondiale odierna nel campo dell’architettura e dell’ingegneria si esprime quasi esclusivamente mediante l’approccio BIM, Building Information Modeling. Anche se sviluppato pensando alle nuove costruzioni ed ancora in via di perfezionamento, è entrato prepotentemente nei capitoli normativi di molti stati all’urlo dell“interoperability”. Su questo tema è recente l’interesse e la possibilità di adozione per l’intervento sul costruito, ovvero di Existing Building Information Modelling, eBIM. Gli studi applicativi-sperimentali in questo ambito sono sempre più numerosi e convergono, purtroppo, sulla delicata correlazione tra la gestione del contenuto semantico e la perdita di interoperabilità. Questa tesi si incentra sull’analisi di tale correlazione valutando in particolare l’aspetto metodologico-applicativo dell’arricchimento semantico adottando come caso studio la Torre Nord della Rocca Estense di San Felice sul Panaro. (ENG)Today's global industry in architecture and engineering fields, expresses itself almost entirely focusing on BIM, Building Information Modeling. Even though it was developed taking in consideration new buildings and the ones that are in the process of improvement, it has entered the regulatory chapters of many states in the hymn of "interoperability". Concerning this topic is recent the interest and possibility of adopting a process to intervene on the already built constructions, Existing Building Information Modeling, eBIM. Application-experimental studies in this area are increasingly numerous and unfortunately converge, on the delicate correlation between the management of the semantic content and the loss of interoperability. This thesis focuses on the analysis of this correlation by evaluating in particular the methodological-applicative aspect of semantic enrichment by adopting the North Tower of the Rocca Estense in San Felice sul Panaro as a case study.
Resumo:
With the advent of high-performance computing devices, deep neural networks have gained a lot of popularity in solving many Natural Language Processing tasks. However, they are also vulnerable to adversarial attacks, which are able to modify the input text in order to mislead the target model. Adversarial attacks are a serious threat to the security of deep neural networks, and they can be used to craft adversarial examples that steer the model towards a wrong decision. In this dissertation, we propose SynBA, a novel contextualized synonym-based adversarial attack for text classification. SynBA is based on the idea of replacing words in the input text with their synonyms, which are selected according to the context of the sentence. We show that SynBA successfully generates adversarial examples that are able to fool the target model with a high success rate. We demonstrate three advantages of this proposed approach: (1) effective - it outperforms state-of-the-art attacks by semantic similarity and perturbation rate, (2) utility-preserving - it preserves semantic content, grammaticality, and correct types classified by humans, and (3) efficient - it performs attacks faster than other methods.
Resumo:
While much of a company's knowledge can be found in text repositories, current content management systems have limited capabilities for structuring and interpreting documents. In the emerging Semantic Web, search, interpretation and aggregation can be addressed by ontology-based semantic mark-up. In this paper, we examine semantic annotation, identify a number of requirements, and review the current generation of semantic annotation systems. This analysis shows that, while there is still some way to go before semantic annotation tools will be able to address fully all the knowledge management needs, research in the area is active and making good progress.
Resumo:
Personal memories composed of digital pictures are very popular at the moment. To retrieve these media items annotation is required. During the last years, several approaches have been proposed in order to overcome the image annotation problem. This paper presents our proposals to address this problem. Automatic and semi-automatic learning methods for semantic concepts are presented. The automatic method is based on semantic concepts estimated using visual content, context metadata and audio information. The semi-automatic method is based on results provided by a computer game. The paper describes our proposals and presents their evaluations.
Resumo:
Dissertação para obtenção do Grau de Mestre em Engenharia Informática
Resumo:
A newspaper content management system has to deal with a very heterogeneous information space as the experience in the Diari Segre newspaper has shown us. The greatest problem is to harmonise the different ways the involved users (journalist, archivists...) structure the newspaper information space, i.e. news, topics, headlines, etc. Our approach is based on ontology and differentiated universes of discourse (UoD). Users interact with the system and, from this interaction, integration rules are derived. These rules are based on Description Logic ontological relations for subsumption and equivalence. They relate the different UoD and produce a shared conceptualisation of the newspaper information domain.
Resumo:
Article About the Authors Metrics Comments Related Content Abstract Introduction Functionality Implementation Discussion Acknowledgments Author Contributions References Reader Comments (0) Figures Abstract Despite of the variety of available Web services registries specially aimed at Life Sciences, their scope is usually restricted to a limited set of well-defined types of services. While dedicated registries are generally tied to a particular format, general-purpose ones are more adherent to standards and usually rely on Web Service Definition Language (WSDL). Although WSDL is quite flexible to support common Web services types, its lack of semantic expressiveness led to various initiatives to describe Web services via ontology languages. Nevertheless, WSDL 2.0 descriptions gained a standard representation based on Web Ontology Language (OWL). BioSWR is a novel Web services registry that provides standard Resource Description Framework (RDF) based Web services descriptions along with the traditional WSDL based ones. The registry provides Web-based interface for Web services registration, querying and annotation, and is also accessible programmatically via Representational State Transfer (REST) API or using a SPARQL Protocol and RDF Query Language. BioSWR server is located at http://inb.bsc.es/BioSWR/and its code is available at https://sourceforge.net/projects/bioswr/under the LGPL license.