7 resultados para 280205 Text Processing
em QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast
Resumo:
This article takes as its main point of departure a body of empirical research on reading and text processing, and makes particular reference to the type of experiments conducted in Egidi and Gerrig (2006) and Rapp and Gerrig (2006). Broadly put, these experiments (i) explore the psychology of readers’ preferences for narrative outcomes, (ii) examine the way readers react to characters’ goals and actions, and (iii) investigate how readers tend to identify with characters’ goals the more ‘urgently’ those goals are narrated. The present article signals how stylistics can productively enrich such experimental work. Stylistics, it is argued, is well equipped to deal with subtle and nuanced variations in textual patterns without losing sight of the broader cognitive and discoursal positioning of readers in relation to these patterns. Making particular reference to what might constitute narrative ‘urgency’, the article develops a model which amalgamates different strands of contemporary research in narrative stylistics. This model advances and elaborates three key components: a Stylistic Profile, a Burlesque Block and a Kuleshov Monitor. Developing analyses of, and informal informant tests on, examples of both fiction and film, the article calls for a more rounded and sophisticated understanding of style in empirical research on subjects’ responses to patterns in narrative.
Resumo:
C17 polyacetylenes are a group of bioactive compounds present in carrots which have recently gained scientific attention due to their cytotoxicity against cancer cells. In common with many bioactive compounds, their levels may be influenced by thermal processes, such as boiling or water immersion. This study investigated the effect of a number of water immersion time/temperature combinations on concentrations of these compounds and attempted to model the changes. Carrot samples were thermally treated by heating in water at temperatures from 50–100 °C and holding times of 2–60 min. Following heating, levels of falcarinol (FaOH), falcarindiol (FaDOH), falcarindiol-3-acetate (FaDOAc) and Hunter colour parameters (L*, a*, b*) were determined. FaOH, FaDOH, FaDOAc levels were significantly reduced at lower temperatures (50–60 °C). In contrast, samples heated at temperatures from 70–100 °C exhibited higher levels of polyacetylenes (p < 0.05) than did raw unprocessed samples. Regression modelling was used to model the effects of temperature and holding time on the levels of the variables measured. Temperature treatment and holding time were found to significantly affect the polyacetylene content of carrot disks. Predicted models were found to be significant (p < 0.05) with high coefficients of determination (R2).
Resumo:
Many plain text information hiding techniques demand deep semantic processing, and so suffer in reliability. In contrast, syntactic processing is a more mature and reliable technology. Assuming a perfect parser, this paper evaluates a set of automated and reversible syntactic transforms that can hide information in plain text without changing the meaning or style of a document. A large representative collection of newspaper text is fed through a prototype system. In contrast to previous work, the output is subjected to human testing to verify that the text has not been significantly compromised by the information hiding procedure, yielding a success rate of 96% and bandwidth of 0.3 bits per sentence. © 2007 SPIE-IS&T.
Resumo:
We address the problem of mining interesting phrases from subsets of a text corpus where the subset is specified using a set of features such as keywords that form a query. Previous algorithms for the problem have proposed solutions that involve sifting through a phrase dictionary based index or a document-based index where the solution is linear in either the phrase dictionary size or the size of the document subset. We propose the usage of an independence assumption between query keywords given the top correlated phrases, wherein the pre-processing could be reduced to discovering phrases from among the top phrases per each feature in the query. We then outline an indexing mechanism where per-keyword phrase lists are stored either in disk or memory, so that popular aggregation algorithms such as No Random Access and Sort-merge Join may be adapted to do the scoring at real-time to identify the top interesting phrases. Though such an approach is expected to be approximate, we empirically illustrate that very high accuracies (of over 90%) are achieved against the results of exact algorithms. Due to the simplified list-aggregation, we are also able to provide response times that are orders of magnitude better than state-of-the-art algorithms. Interestingly, our disk-based approach outperforms the in-memory baselines by up to hundred times and sometimes more, confirming the superiority of the proposed method.
Resumo:
Massive amount of data that are geo-tagged and associated with text information are being generated at an unprecedented scale. These geo-textual data cover a wide range of topics. Users are interested in receiving up-to-date geo-textual objects (e.g., geo-tagged Tweets) such that their locations meet users’ need and their texts are interesting to users. For example, a user may want to be updated with tweets near her home on the topic “dengue fever headache.” In this demonstration, we present SOPS, the Spatial-Keyword Publish/Subscribe System, that is capable of efficiently processing spatial keyword continuous queries. SOPS supports two types of queries: (1) Boolean Range Continuous (BRC) query that can be used to subscribe the geo-textual objects satisfying a boolean keyword expression and falling in a specified spatial region; (2) Temporal Spatial-Keyword Top-k Continuous (TaSK) query that continuously maintains up-to-date top-k most relevant results over a stream of geo-textual objects. SOPS enables users to formulate their queries and view the real-time results over a stream of geotextual objects by browser-based user interfaces. On the server side, we propose solutions to efficiently processing a large number of BRC queries (tens of millions) and TaSK queries over a stream of geo-textual objects.