900 resultados para Natural Language Queries, NLPX, Bricks, XML-IR, Users


Relevância:

100.00% 100.00%

Publicador:

Resumo:

This project proposes a framework that identifies high‐value disaster-based information from social media to facilitate key decision-making processes during natural disasters. At present it is very difficult to differentiate between information that has a high degree of disaster relevance and information that has a low degree of disaster relevance. By digitally harvesting and categorising social media conversation streams automatically, this framework identifies highly disaster-relevant information that can be used by emergency services for intelligence gathering and decision-making.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Recent advances in neural language models have contributed new methods for learning distributed vector representations of words (also called word embeddings). Two such methods are the continuous bag-of-words model and the skipgram model. These methods have been shown to produce embeddings that capture higher order relationships between words that are highly effective in natural language processing tasks involving the use of word similarity and word analogy. Despite these promising results, there has been little analysis of the use of these word embeddings for retrieval. Motivated by these observations, in this paper, we set out to determine how these word embeddings can be used within a retrieval model and what the benefit might be. To this aim, we use neural word embeddings within the well known translation language model for information retrieval. This language model captures implicit semantic relations between the words in queries and those in relevant documents, thus producing more accurate estimations of document relevance. The word embeddings used to estimate neural language models produce translations that differ from previous translation language model approaches; differences that deliver improvements in retrieval effectiveness. The models are robust to choices made in building word embeddings and, even more so, our results show that embeddings do not even need to be produced from the same corpus being used for retrieval.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper introduces the META-NORD project which develops Nordic and Baltic part of the European open language resource infrastructure. META-NORD works on assembling, linking across languages, and making widely available the basic language resources used by developers, professionals and researchers to build specific products and applications. The goals of the project, overall approach and specific focus lines on wordnets, terminology resources and treebanks are described. Moreover, results achieved in first five months of the project, i.e. language whitepapers, metadata specification and IPR, are presented.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Edited by Andrea Abel, Chiara Vettori, Natascia Ralli.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We propose a new formally syntax-based method for statistical machine translation. Transductions between parsing trees are transformed into a problem of sequence tagging, which is then tackled by a search- based structured prediction method. This allows us to automatically acquire transla- tion knowledge from a parallel corpus without the need of complex linguistic parsing. This method can achieve compa- rable results with phrase-based method (like Pharaoh), however, only about ten percent number of translation table is used. Experiments show that the structured pre- diction approach for SMT is promising for its strong ability at combining words.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

在低挡微机中速度较慢的串行处理硬设备条件下,利用本文提出的启发式概念,分层搜索和匹配策略以及设置最大搜索长度等方法,可使推理速度提高一个数量级以上.此外,通过引入语义信息,分阶段消除歧义,自顶向下与自底向上相结合,以及把一般疑问句一律变成相应陈述句的方法,解决了自动英语句法分析中的一系列难题,缩小了知识库的规模。

Relevância:

100.00% 100.00%

Publicador:

Resumo:

本文简要介绍了一个数控自动编程专家系统的自然语言接口的实现.该自然语言接口是以我们研制的数控自动编程专家系统为背景,运行在 SUN3/4 工作站的 UNIX 下和 IBM/AT 机的 DOS 下,用 C语言编程.该自然语言接口由词法分析、句法分析、语义语用分析、目标生成和图形仿真五个模块及相应的知识库构成.该接口能够接受数控编程系统所需的对工件的英语自然语言描述并处理一些比较简单的英语语言现象.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

According to PDP theory, the author tries to use ANN method in sentence understanding. In input layer, distributed knowledge representation and integrate syntactic, semantic information (of the word in Chinese sentence) and context information are used to complete the case role assignment of six types of Chinese sentence by parallel processing. The model is a four-layer forward network, consisting of input layer, two hidden layers, and output layer(case role layer). In addition, the neural network method and the traditional symbol processing method used in natural language understanding is compared and analyzed, and a conclusion could be made: the neural network should be used as a powerful tool in this area.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In Phys. Rev. Letters (73:2), Mantegna et al. conclude on the basis of Zipf rank frequency data that noncoding DNA sequence regions are more like natural languages than coding regions. We argue on the contrary that an empirical fit to Zipf"s "law" cannot be used as a criterion for similarity to natural languages. Although DNA is a presumably "organized system of signs" in Mandelbrot"s (1961) sense, and observation of statistical featurs of the sort presented in the Mantegna et al. paper does not shed light on the similarity between DNA's "gramar" and natural language grammars, just as the observation of exact Zipf-like behavior cannot distinguish between the underlying processes of tossing an M-sided die or a finite-state branching process.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This thesis proposes a computational model of how children may come to learn the meanings of words in their native language. The proposed model is divided into two separate components. One component produces semantic descriptions of visually observed events while the other correlates those descriptions with co-occurring descriptions of those events in natural language. The first part of this thesis describes three implementations of the correlation process whereby representations of the meanings of whole utterances can be decomposed into fragments assigned as representations of the meanings of individual words. The second part of this thesis describes an implemented computer program that recognizes the occurrence of simple spatial motion events in simulated video input.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In previous papers, we have presented a logic-based framework based on fusion rules for merging structured news reports. Structured news reports are XML documents, where the textentries are restricted to individual words or simple phrases, such as names and domain-specific terminology, and numbers and units. We assume structured news reports do not require natural language processing. Fusion rules are a form of scripting language that define how structured news reports should be merged. The antecedent of a fusion rule is a call to investigate the information in the structured news reports and the background knowledge, and the consequent of a fusion rule is a formula specifying an action to be undertaken to form a merged report. It is expected that a set of fusion rules is defined for any given application. In this paper we extend the approach to handling probability values, degrees of beliefs, or necessity measures associated with textentries in the news reports. We present the formal definition for each of these types of uncertainty and explain how they can be handled using fusion rules. We also discuss the methods of detecting inconsistencies among sources.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Tese de doutoramento, Informática (Ciências da Computação), Universidade de Lisboa, Faculdade de Ciências, 2014