894 resultados para Text categorization
Resumo:
This essay explores the ways in which the performance of Jewish identity (in the sense both of representing Jewish characters and of writing about those characters’ conscious and unconscious renditions of their Jewishness) is a particular concern (in both senses of the word) for Lorrie Moore. Tracing Moore's representations of Jewishness over the course of her career, from the early story “The Jewish Hunter” through to her most recent novel, A Gate at the Stairs, I argue that it is characterized by (borrowing a phrase from Moore herself) “performance anxiety,” an anxiety that manifests itself in awkward comedy and that can be read both in biographical terms and as an oblique commentary on, or reworking of, the passing narrative, which I call “anti-passing.” Just as passing narratives complicate conventional ethno-racial definitions so Moore's anti-passing narratives, by representing Jews who represent themselves as other to themselves, as well as to WASP America, destabilize the category of Jewishness and, by implication, deconstruct the very notion of ethnic categorization.
Resumo:
Aircraft Maintenance, Repair and Overhaul (MRO) feedback commonly includes an engineer’s complex text-based inspection report. Capturing and normalizing the content of these textual descriptions is vital to cost and quality benchmarking, and provides information to facilitate continuous improvement of MRO process and analytics. As data analysis and mining tools requires highly normalized data, raw textual data is inadequate. This paper offers a textual-mining solution to efficiently analyse bulk textual feedback data. Despite replacement of the same parts and/or sub-parts, the actual service cost for the same repair is often distinctly different from similar previously jobs. Regular expression algorithms were incorporated with an aircraft MRO glossary dictionary in order to help provide additional information concerning the reason for cost variation. Professional terms and conventions were included within the dictionary to avoid ambiguity and improve the outcome of the result. Testing results show that most descriptive inspection reports can be appropriately interpreted, allowing extraction of highly normalized data. This additional normalized data strongly supports data analysis and data mining, whilst also increasing the accuracy of future quotation costing. This solution has been effectively used by a large aircraft MRO agency with positive results.
Resumo:
The performance of Samuel Daniel's masque The Vision of the Twelve Goddesses at court on January 8, 1604 took place in the midst of the preliminary negotiations that would lead to the signing of the Anglo-Spanish peace at Somerset House the following August. Philip III sent a special ambassador to England to congratulate James on his accession, and a series of tussles between Juan de Tassis and his French counterpart ensued. As a recently-discovered document in the Archivo General de Simancas reveals, Anna of Denmark intervened personally to insure that de Tassis, and not the Frenchman, attended the masque. This was a clear signal of James and Anna's peace aims, which de Tassis conveyed to the King of Spain; moreover, he enclosed in his dispatch a text of Daniel's masque which he clearly considered both political intelligence and of interest to the theater-loving Hapsburg monarch. The Simancas text of the Daniel masque is a new version, hitherto unknown, which adds to our knowledge of the circumstances in which the first Stuart masque was performed. Here we present a transcription and annotated translation of both de Tassis' letter and the text of the masque he had compiled for Philip III. (B. C.-E. and M. H.)
Resumo:
The recent increase in short messaging system (SMS) text messaging, often using abbreviated, non-conventional ‘textisms’ (e.g. ‘2nite’), in school-aged children has raised fears of negative consequences of such technology for literacy. The current research used a paradigm developed by Dixon and Kaminska, who showed that exposure to phonetically plausible misspellings (e.g. ‘recieve’) negatively affected subsequent spelling performance, though this was true only with adults, not children. The current research extends this work to directly investigate the effects of exposure to textisms, misspellings and correctly spelledwords on adults’ spelling. Spelling of a set of key words was assessed both before and after an exposure phase where participants read the same key words, presented either as textisms (e.g. ‘2nite’), correctly spelled (e.g. ‘tonight’) or misspelled (e.g. 'tonite’)words. Analysis showed that scores decreased from pre- to post-test following exposure to misspellings, whereas performance improved following exposure to correctly spelled words and, interestingly, to textisms. Data suggest that exposure to textisms, unlike misspellings, had a positive effect on adults’ spelling. These findings are interpreted in light of other recent research suggesting a positive relationship between texting and some literacy measures in school-aged children.
Resumo:
Typeface design: a series of collaborative projects commissioned by Adobe, Inc. and Brill to develop extensive polytonic Greek typefaces. The two Adobe typefaces can be seen as extension of previous research for the Garamond Premier Pro family (2005), and concludes a research theme started in 1998 with work for Adobe’s Minion Pro Greek. These typefaces together define the state of the art for text-intensive Greek typesetting for wide character set texts (from classical texts, to poetry, to essays, to prose). They serve both as exemplar for other developers, and as vehicles for developing the potential of Greek text typography, for example with the parallel inclusion of monotonic and polytonic characters, detailed localised punctuation options, fluid handling of case-conversion issues, and innovative options such as accented small caps (originally requested by bibliographers, and subsequently rolled out to a general user base). The Brill typeface (for the established academic publisher) has an exceptionally wide character set to cover several academic disciplines, and is intended to differentiate sufficiently from its partner Latin typeface, while maintaining a clear texture in both offset and low-resolution print-on-demand reproduction. This work involved substantial amounts of testing and modifying the design, especially of diacritics, to maintain clarity the readability of unfamiliar words. All together these typefaces form a study in how Greek typesetting meets contemporary typographic requirements, while resonating with historically accurate styles, where these are present. Significant research in printing archives helped to identify appropriate styles, as well as originate variants that are coherent stylistically, even when historical equivalents were absent.
Resumo:
Background: Since their inception, Twitter and related microblogging systems have provided a rich source of information for researchers and have attracted interest in their affordances and use. Since 2009 PubMed has included 123 journal articles on medicine and Twitter, but no overview exists as to how the field uses Twitter in research. // Objective: This paper aims to identify published work relating to Twitter indexed by PubMed, and then to classify it. This classification will provide a framework in which future researchers will be able to position their work, and to provide an understanding of the current reach of research using Twitter in medical disciplines. Limiting the study to papers indexed by PubMed ensures the work provides a reproducible benchmark. // Methods: Papers, indexed by PubMed, on Twitter and related topics were identified and reviewed. The papers were then qualitatively classified based on the paper’s title and abstract to determine their focus. The work that was Twitter focused was studied in detail to determine what data, if any, it was based on, and from this a categorization of the data set size used in the studies was developed. Using open coded content analysis additional important categories were also identified, relating to the primary methodology, domain and aspect. // Results: As of 2012, PubMed comprises more than 21 million citations from biomedical literature, and from these a corpus of 134 potentially Twitter related papers were identified, eleven of which were subsequently found not to be relevant. There were no papers prior to 2009 relating to microblogging, a term first used in 2006. Of the remaining 123 papers which mentioned Twitter, thirty were focussed on Twitter (the others referring to it tangentially). The early Twitter focussed papers introduced the topic and highlighted the potential, not carrying out any form of data analysis. The majority of published papers used analytic techniques to sort through thousands, if not millions, of individual tweets, often depending on automated tools to do so. Our analysis demonstrates that researchers are starting to use knowledge discovery methods and data mining techniques to understand vast quantities of tweets: the study of Twitter is becoming quantitative research. // Conclusions: This work is to the best of our knowledge the first overview study of medical related research based on Twitter and related microblogging. We have used five dimensions to categorise published medical related research on Twitter. This classification provides a framework within which researchers studying development and use of Twitter within medical related research, and those undertaking comparative studies of research relating to Twitter in the area of medicine and beyond, can position and ground their work.
Resumo:
This paper introduces a novel approach for free-text keystroke dynamics authentication which incorporates the use of the keyboard’s key-layout. The method extracts timing features from specific key-pairs. The Euclidean distance is then utilized to find the level of similarity between a user’s profile data and his/her test data. The results obtained from this method are reasonable for free-text authentication while maintaining the maximum level of user relaxation. Moreover, it has been proven in this study that flight time yields better authentication results when compared with dwell time. In particular, the results were obtained with only one training sample for the purpose of practicality and ease of real life application.