967 resultados para Natural language techniques, Semantic spaces, Random projection, Documents


Relevância:

40.00% 40.00%

Publicador:

Resumo:

This proposal is a non-quantitative study based on a corpus of real data which offers a principled account of the translation strategies employed in the translation of English film titles into Spanish in terms of cognitive modeling. More specifically, we draw on Ruiz de Mendoza and Galera’s (2014) work on what they term content (or low-level) cognitive operations, based on either ‘stands for’ or ‘identity’ relations, in order to investigate possible motivating factors for translations which abide by oblique procedures, i.e. for non-literal renderings of source titles. The present proposal is made in consonance with recent findings within the framework of Cognitive Linguistics (Samaniego 2007), which evidence that this linguistic approach can fruitfully address some relevant issues in Translation Studies, the most outstanding for our purposes being the exploration of the cognitive operations which account for the use of translation strategies (Rojo and Ibarretxe-Antuñano 2013: 10), mainly expansion and reduction operations, parameterization, echoing, mitigation and comparison by contrast. This fits in nicely with a descriptive approach to translation and particularly with skopos theory, whose main aim consists in achieving functionally adequate renderings of source texts.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Secure Multi-party Computation (MPC) enables a set of parties to collaboratively compute, using cryptographic protocols, a function over their private data in a way that the participants do not see each other's data, they only see the final output. Typical MPC examples include statistical computations over joint private data, private set intersection, and auctions. While these applications are examples of monolithic MPC, richer MPC applications move between "normal" (i.e., per-party local) and "secure" (i.e., joint, multi-party secure) modes repeatedly, resulting overall in mixed-mode computations. For example, we might use MPC to implement the role of the dealer in a game of mental poker -- the game will be divided into rounds of local decision-making (e.g. bidding) and joint interaction (e.g. dealing). Mixed-mode computations are also used to improve performance over monolithic secure computations. Starting with the Fairplay project, several MPC frameworks have been proposed in the last decade to help programmers write MPC applications in a high-level language, while the toolchain manages the low-level details. However, these frameworks are either not expressive enough to allow writing mixed-mode applications or lack formal specification, and reasoning capabilities, thereby diminishing the parties' trust in such tools, and the programs written using them. Furthermore, none of the frameworks provides a verified toolchain to run the MPC programs, leaving the potential of security holes that can compromise the privacy of parties' data. This dissertation presents language-based techniques to make MPC more practical and trustworthy. First, it presents the design and implementation of a new MPC Domain Specific Language, called Wysteria, for writing rich mixed-mode MPC applications. Wysteria provides several benefits over previous languages, including a conceptual single thread of control, generic support for more than two parties, high-level abstractions for secret shares, and a fully formalized type system and operational semantics. Using Wysteria, we have implemented several MPC applications, including, for the first time, a card dealing application. The dissertation next presents Wys*, an embedding of Wysteria in F*, a full-featured verification oriented programming language. Wys* improves on Wysteria along three lines: (a) It enables programmers to formally verify the correctness and security properties of their programs. As far as we know, Wys* is the first language to provide verification capabilities for MPC programs. (b) It provides a partially verified toolchain to run MPC programs, and finally (c) It enables the MPC programs to use, with no extra effort, standard language constructs from the host language F*, thereby making it more usable and scalable. Finally, the dissertation develops static analyses that help optimize monolithic MPC programs into mixed-mode MPC programs, while providing similar privacy guarantees as the monolithic versions.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Conventional web search engines are centralised in that a single entity crawls and indexes the documents selected for future retrieval, and the relevance models used to determine which documents are relevant to a given user query. As a result, these search engines suffer from several technical drawbacks such as handling scale, timeliness and reliability, in addition to ethical concerns such as commercial manipulation and information censorship. Alleviating the need to rely entirely on a single entity, Peer-to-Peer (P2P) Information Retrieval (IR) has been proposed as a solution, as it distributes the functional components of a web search engine – from crawling and indexing documents, to query processing – across the network of users (or, peers) who use the search engine. This strategy for constructing an IR system poses several efficiency and effectiveness challenges which have been identified in past work. Accordingly, this thesis makes several contributions towards advancing the state of the art in P2P-IR effectiveness by improving the query processing and relevance scoring aspects of a P2P web search. Federated search systems are a form of distributed information retrieval model that route the user’s information need, formulated as a query, to distributed resources and merge the retrieved result lists into a final list. P2P-IR networks are one form of federated search in routing queries and merging result among participating peers. The query is propagated through disseminated nodes to hit the peers that are most likely to contain relevant documents, then the retrieved result lists are merged at different points along the path from the relevant peers to the query initializer (or namely, customer). However, query routing in P2P-IR networks is considered as one of the major challenges and critical part in P2P-IR networks; as the relevant peers might be lost in low-quality peer selection while executing the query routing, and inevitably lead to less effective retrieval results. This motivates this thesis to study and propose query routing techniques to improve retrieval quality in such networks. Cluster-based semi-structured P2P-IR networks exploit the cluster hypothesis to organise the peers into similar semantic clusters where each such semantic cluster is managed by super-peers. In this thesis, I construct three semi-structured P2P-IR models and examine their retrieval effectiveness. I also leverage the cluster centroids at the super-peer level as content representations gathered from cooperative peers to propose a query routing approach called Inverted PeerCluster Index (IPI) that simulates the conventional inverted index of the centralised corpus to organise the statistics of peers’ terms. The results show a competitive retrieval quality in comparison to baseline approaches. Furthermore, I study the applicability of using the conventional Information Retrieval models as peer selection approaches where each peer can be considered as a big document of documents. The experimental evaluation shows comparative and significant results and explains that document retrieval methods are very effective for peer selection that brings back the analogy between documents and peers. Additionally, Learning to Rank (LtR) algorithms are exploited to build a learned classifier for peer ranking at the super-peer level. The experiments show significant results with state-of-the-art resource selection methods and competitive results to corresponding classification-based approaches. Finally, I propose reputation-based query routing approaches that exploit the idea of providing feedback on a specific item in the social community networks and manage it for future decision-making. The system monitors users’ behaviours when they click or download documents from the final ranked list as implicit feedback and mines the given information to build a reputation-based data structure. The data structure is used to score peers and then rank them for query routing. I conduct a set of experiments to cover various scenarios including noisy feedback information (i.e, providing positive feedback on non-relevant documents) to examine the robustness of reputation-based approaches. The empirical evaluation shows significant results in almost all measurement metrics with approximate improvement more than 56% compared to baseline approaches. Thus, based on the results, if one were to choose one technique, reputation-based approaches are clearly the natural choices which also can be deployed on any P2P network.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

This paper presents a study made in a field poorly explored in the Portuguese language – modality and its automatic tagging. Our main goal was to find a set of attributes for the creation of automatic tag- gers with improved performance over the bag-of-words (bow) approach. The performance was measured using precision, recall and F1. Because it is a relatively unexplored field, the study covers the creation of the corpus (composed by eleven verbs), the use of a parser to extract syntac- tic and semantic information from the sentences and a machine learning approach to identify modality values. Based on three different sets of attributes – from trigger itself and the trigger’s path (from the parse tree) and context – the system creates a tagger for each verb achiev- ing (in almost every verb) an improvement in F1 when compared to the traditional bow approach.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Urbanization has grown during the last decades, with an increase in population concentrated in cities. Cities are usually relatively nature-poor, and the loss of green urban space likely leads to less contact with the natural world for urban dwellers. It is known that the natural environment could provide important advantages, and the loss of contact with this type of environment has potential negative impacts on the quality of life. The use of green urban space demonstrated stronger benefits for mental health and stress reduction. In general, exposure to green urban space is linked to a reduction in mortality rates, due to the promotion of a healthy lifestyle. Green urban space could be an optimal environment in which to perform physical activity. Undertaking regular physical activity is one of the major determinants of health. The benefits of exercise have been widely demonstrated through a wide range of studies. Benefits are linked to the treatment and prevention of most chronic and non-communicable diseases, that are not contagious, but they are usually long-lasting. Regular physical activity could reduce mental health problems, such as anxiety. The World Health Organization proposed to improve physical activity programs through the implementation of interventions in green urban spaces. Green urban space provides a safe, accessible, and attractive place to perform physical activity. All the interventions aimed to promote the practice of physical activity and to reduce sedentary behavior are important. It is well known that physical activity has several positive effects, a great amount of the population remains inactive. A good strategy could be to show people how integrated physical activity into their all-day life, for example through the use of green urban space or active commuting. The results in the present thesis showed the effectiveness of performing physical activity in a natural environment and of active commuting.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Artificial Intelligence is reshaping the field of fashion industry in different ways. E-commerce retailers exploit their data through AI to enhance their search engines, make outfit suggestions and forecast the success of a specific fashion product. However, it is a challenging endeavour as the data they possess is huge, complex and multi-modal. The most common way to search for fashion products online is by matching keywords with phrases in the product's description which are often cluttered, inadequate and differ across collections and sellers. A customer may also browse an online store's taxonomy, although this is time-consuming and doesn't guarantee relevant items. With the advent of Deep Learning architectures, particularly Vision-Language models, ad-hoc solutions have been proposed to model both the product image and description to solve this problems. However, the suggested solutions do not exploit effectively the semantic or syntactic information of these modalities, and the unique qualities and relations of clothing items. In this work of thesis, a novel approach is proposed to address this issues, which aims to model and process images and text descriptions as graphs in order to exploit the relations inside and between each modality and employs specific techniques to extract syntactic and semantic information. The results obtained show promising performances on different tasks when compared to the present state-of-the-art deep learning architectures.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

PURPOSE: To determine the association between language and number of citations of ophthalmology articles published in Brazilian journals. METHODS: This study was a systematic review. Original articles were identified by review of documents published at the two Brazilian ophthalmology journals indexed at Science Citation Index Expanded - SCIE [Arquivos Brasileiros de Oftalmologia (ABO) and Revista Brasileira de Oftalmologia (RBO)]. All document types (articles and reviews) listed at SCIE in English (English Group) or in Portuguese (Portuguese Group) from January 1, 2008 to December 31, 2009 were included, except: editorial materials; corrections; letters; and biographical items. The primary outcome was the number of citations through the end of second year after publication date. Subgroup analysis included likelihood of citation (cited at least once versus no citation), journal, and year of publication. RESULTS: The search at the web of science revealed 382 articles [107 (28%) in the English Group and 275 (72%) in the Portuguese Group]. Of those, 297 (77.7%) were published at the ABO and 85 (23.3%) at the RBO. The citation counts were statistically significantly higher (P<0.001) in the English Group (1.51 - SD 1.98 - range 0 to 11) compared with the Portuguese Group (0.57 - SD 1.06 - range 0 to 7). The likelihood citation was statistically significant higher (P<0.001) in the English Group (70/107 - 65.4%) compared with the Portuguese Group (89/275 - 32.7%). There were more articles published in English at the ABO (98/297 - 32.9%) than at the RBO (9/85 - 10.6%) [P<0.001]. There were no significant difference (P=0.967) at the proportion of articles published in English at the years 2008 (48/172 - 27.9%) and 2009 (59/210 - 28.1%). CONCLUSION: The number of citations of articles published in Portuguese at Brazilian ophthalmology journals is lower than the published in English. The results of this study suggest that the editorial boards should strongly encourage the authors to adopt English as the main language in their future articles.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

TEMA: a produção da fala nas modalidades de reabilitação oral protética. OBJETIVO: verificar se o tipo de reabilitação oral interfere na produção da fala. MÉTODO: 36 idosos (média = 68 anos), divididos em 3 grupos, foram avaliados: 13 com dentes naturais (A), 13 com prótese total mucosossuportada superior e inferior (B) e 10 com prótese total mucosossuportada superior e implantossuportada inferior (C). A estabilidade das próteses foi avaliada por um dentista e amostras de fala foram analisadas por 5 fonoaudiólogos. Para determinar a freqüência de alteração dos sons da fala utilizou-se o cálculo da Porcentagem de Consoantes Corretas (PCC). RESULTADOS: observou-se poucos casos com alteração de fala, com maior freqüência no grupo C (23,08%), sendo a articulação travada presente em todos os grupos, a redução dos movimentos labiais em dois grupos (A e B) e a articulação exagerada e a falta de controle salivar em um dos grupos (C e B). Quanto à PCC, menor valor foi observado para os fones linguodentais nos grupos B e C (maior ocorrência de alteração), seguido dos fones alveolares, predominando casos sem alteração no grupo A, contrariamente aos demais grupos, sendo a projeção lingual e o ceceio as alterações mais encontradas. Não houve diferença entre os grupos e a maioria do grupo B estava com a prótese inferior insatisfatória, não havendo associação entre alteração de fala e prótese insatisfatória. CONCLUSÃO: apesar da amostra pequena, indivíduos reabilitados com prótese total apresentam alteração nos fones linguodentais e alveolares e o tipo de prótese, bem como a estabilidade desta parece não interferir na produção da fala.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The objective of this study is to describe preliminary results from the cross-cultural adaptation of the Quality of Life Assessment Questionnaire, used to measure health related quality of life (HRQL) in Brazilian children aged between 5 and 11 with HIV/AIDS. The cross-cultural model evaluated the Concept, Item, Semantic and Measurement Equivalences (internal consistency and intra-observer reliability). Evaluation of the conceptual, item, semantic equivalences showed that the Portuguese version is pertinent for the Brazilian context. Four of seven domains showed internal consistency above 0.70 (α: 0.76-0.90) and five of seven revealed intra-observer reliability (ricc: 0.41-0.70). This first Portuguese version of the HRQL questionnaire can be understood as a valuable tool for assessing children's HRQL, but further studies with large samples and more robust analyses are recommended before use in the Brazilian context.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This work is part of a research under construction since 2000, in which the main objective is to measure small dynamic displacements by using L1 GPS receivers. A very sensible way to detect millimetric periodic displacements is based on the Phase Residual Method (PRM). This method is based on the frequency domain analysis of the phase residuals resulted from the L1 double difference static data processing of two satellites in almost orthogonal elevation angle. In this article, it is proposed to obtain the phase residuals directly from the raw phase observable collected in a short baseline during a limited time span, in lieu of obtaining the residual data file from regular GPS processing programs which not always allow the choice of the aimed satellites. In order to improve the ability to detect millimetric oscillations, two filtering techniques are introduced. One is auto-correlation which reduces the phase noise with random time behavior. The other is the running mean to separate low frequency from the high frequency phase sources. Two trials have been carried out to verify the proposed method and filtering techniques. One simulates a 2.5 millimeter vertical antenna displacement and the second uses the GPS data collected during a bridge load test. The results have shown a good consistency to detect millimetric oscillations.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Due to both the widespread and multipurpose use of document images and the current availability of a high number of document images repositories, robust information retrieval mechanisms and systems have been increasingly demanded. This paper presents an approach to support the automatic generation of relationships among document images by exploiting Latent Semantic Indexing (LSI) and Optical Character Recognition (OCR). We developed the LinkDI (Linking of Document Images) service, which extracts and indexes document images content, computes its latent semantics, and defines relationships among images as hyperlinks. LinkDI was experimented with document images repositories, and its performance was evaluated by comparing the quality of the relationships created among textual documents as well as among their respective document images. Considering those same document images, we ran further experiments in order to compare the performance of LinkDI when it exploits or not the LSI technique. Experimental results showed that LSI can mitigate the effects of usual OCR misrecognition, which reinforces the feasibility of LinkDI relating OCR output with high degradation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: We aimed to compare plasma concentrations of carbon dioxide (CO(2)) in dogs that underwent intra- and preperitoneal CO(2) insufflation. Materials and Methods: Thirty dogs were studied. Ten formed a control group, 10 underwent intraperitoneal CO(2) insufflation, and 10 underwent preperitoneal CO(2) insufflation. General anesthesia with controlled ventilation was standardized for all dogs. After stabilizing the anesthesia, blood samples were collected at predetermined times and were sent for immediate gasometric analysis. Analysis of variance was used for comparing variables. Results: The plasma CO(2) concentration in the intraperitoneal insufflation group increased significantly more than in the preperitoneal insufflation group and was significantly greater than in the control group (P < 0.05). The pH values in the intraperitoneal group were lower than in the preperitoneal group (P < 0.05). Conclusion: The data from this study suggest that a greater plasma concentration of CO(2) is achieved by insufflation at constant pressure into the intraperitoneal space than into the preperitoneal space.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We consider the problem of interaction neighborhood estimation from the partial observation of a finite number of realizations of a random field. We introduce a model selection rule to choose estimators of conditional probabilities among natural candidates. Our main result is an oracle inequality satisfied by the resulting estimator. We use then this selection rule in a two-step procedure to evaluate the interacting neighborhoods. The selection rule selects a small prior set of possible interacting points and a cutting step remove from this prior set the irrelevant points. We also prove that the Ising models satisfy the assumptions of the main theorems, without restrictions on the temperature, on the structure of the interacting graph or on the range of the interactions. It provides therefore a large class of applications for our results. We give a computationally efficient procedure in these models. We finally show the practical efficiency of our approach in a simulation study.