183 resultados para rich text format
Resumo:
Description of a patient's injuries is recorded in narrative text form by hospital emergency departments. For statistical reporting, this text data needs to be mapped to pre-defined codes. Existing research in this field uses the Naïve Bayes probabilistic method to build classifiers for mapping. In this paper, we focus on providing guidance on the selection of a classification method. We build a number of classifiers belonging to different classification families such as decision tree, probabilistic, neural networks, and instance-based, ensemble-based and kernel-based linear classifiers. An extensive pre-processing is carried out to ensure the quality of data and, in hence, the quality classification outcome. The records with a null entry in injury description are removed. The misspelling correction process is carried out by finding and replacing the misspelt word with a soundlike word. Meaningful phrases have been identified and kept, instead of removing the part of phrase as a stop word. The abbreviations appearing in many forms of entry are manually identified and only one form of abbreviations is used. Clustering is utilised to discriminate between non-frequent and frequent terms. This process reduced the number of text features dramatically from about 28,000 to 5000. The medical narrative text injury dataset, under consideration, is composed of many short documents. The data can be characterized as high-dimensional and sparse, i.e., few features are irrelevant but features are correlated with one another. Therefore, Matrix factorization techniques such as Singular Value Decomposition (SVD) and Non Negative Matrix Factorization (NNMF) have been used to map the processed feature space to a lower-dimensional feature space. Classifiers with these reduced feature space have been built. In experiments, a set of tests are conducted to reflect which classification method is best for the medical text classification. The Non Negative Matrix Factorization with Support Vector Machine method can achieve 93% precision which is higher than all the tested traditional classifiers. We also found that TF/IDF weighting which works well for long text classification is inferior to binary weighting in short document classification. Another finding is that the Top-n terms should be removed in consultation with medical experts, as it affects the classification performance.
Resumo:
This paper evaluates the performance of different text recognition techniques for a mobile robot in an indoor (university campus) environment. We compared four different methods: our own approach using existing text detection methods (Minimally Stable Extremal Regions detector and Stroke Width Transform) combined with a convolutional neural network, two modes of the open source program Tesseract, and the experimental mobile app Google Goggles. The results show that a convolutional neural network combined with the Stroke Width Transform gives the best performance in correctly matched text on images with single characters whereas Google Goggles gives the best performance on images with multiple words. The dataset used for this work is released as well.
Resumo:
Coral reefs provide an increasingly important archive of palaeoclimate data that can be used to constrain climate model simulations. Reconstructing past environmental conditions may also provide insights into the potential of reef systems to survive changes in the Earth’s climate. Reef-based palaeoclimate reconstructions are predominately derived from colonies of massive Porites, with the most abundant genus in the Indo-Pacific—Acropora—receiving little attention owing to their branching growth trajectories, high extension rates and secondary skeletal thickening. However, inter-branch skeleton (consisting of both coenosteum and corallites) near the bases of corymbose Acropora colonies holds significant potential as a climate archive. This region of Acropora skeleton is atypical, having simple growth trajectories with parallel corallites, approximately horizontal density banding, low apparent extension rates and a simple microstructure with limited secondary thickening. Hence, inter-branch skeleton in Acropora bears more similarities to the coralla of massive corals, such as Porites, than to traditional Acropora branches. Cyclic patterns of Sr/Ca ratios in this structure suggest that the observed density banding is annual in nature, thus opening up the potential to use abundant corymbose Acropora for palaeoclimate reconstruction.
Resumo:
This project is a step forward in the study of text mining where enhanced text representation with semantic information plays a significant role. It develops effective methods of entity-oriented retrieval, semantic relation identification and text clustering utilizing semantically annotated data. These methods are based on enriched text representation generated by introducing semantic information extracted from Wikipedia into the input text data. The proposed methods are evaluated against several start-of-art benchmarking methods on real-life data-sets. In particular, this thesis improves the performance of entity-oriented retrieval, identifies different lexical forms for an entity relation and handles clustering documents with multiple feature spaces.
Resumo:
Objective. To test the impact of a theory-based, SMS (text message)-delivered behavioural intervention (Healthy Text) targeting sun protection or skin self-examination behaviours compared to attention-control. Method. Overall, 546 participants aged 18–42 years were randomised using a computer-generated number list to the skin self-examination (N = 176), sun protection (N = 187), or attention-control (N = 183) text messages group. Each group received 21 text messages about their assigned topic over 12 months (12 weekly messages for three months, then monthly messages for the next nine months). Data was collected via telephone survey at baseline, three-, and 12-months across Queensland from January 2012 to August 2013. Results. One year after baseline, the sun protection (mean change 0.12; P = 0.030) and skin self-examination groups (mean change 0.12; P = 0.035) had significantly greater improvement in their sun protection habits (SPH) index compared to the attention-control group (reference mean change 0.02). The increase in the proportion of participants who reported any skin self-examination from baseline to 12 months was significantly greater in the skin self-examination intervention group (103/163; 63%; P < 0.001) than the sun protection (83/173; 48%), or attention-control (65/165; 36%) groups. There was no significant effect of the intervention for participants who self-reported whole-body skin self-examination, sun tanning behaviour, or sunburn behaviours. Conclusion. The Healthy Text intervention was effective in inducing significant improvements in sun protection and any type of skin self-examination behaviours.
Resumo:
The Dulux Study Tour is a collaborative initiative between Dulux, the Australian Institute of Architects and EmAGN. Each year five (5) architectural professionals (within 10 years of graduation) are selected to join the Dulux Study Tour, an international tour visiting leading architectural firms, recently completed projects and architecturally rich locations. The Dulux Study Tour acknowledges the contribution the selected emerging architects make to practice, research and the culture of architecture, and seeks to further inspire the next generation of emerging architects.
Resumo:
Background Rapid diagnostic tests (RDTs) for detection of Plasmodium falciparum infection that target P. falciparum histidine-rich protein 2 (PfHRP2), a protein that circulates in the blood of patients infected with this species of malaria, are widely used to guide case management. Understanding determinants of PfHRP2 availability in circulation is therefore essential to understanding the performance of PfHRP2-detecting RDTs. Methods The possibility that pre-formed host anti-PfHRP2 antibodies may block target antigen detection, thereby causing false negative test results was investigated in this study. Results Anti-PfHRP2 antibodies were detected in 19/75 (25%) of plasma samples collected from patients with acute malaria from Cambodia, Nigeria and the Philippines, as well as in 3/28 (10.7%) asymptomatic Solomon Islands residents. Pre-incubation of plasma samples from subjects with high-titre anti-PfHRP2 antibodies with soluble PfHRP2 blocked the detection of the target antigen on two of the three brands of RDTs tested, leading to false negative results. Pre-incubation of the plasma with intact parasitized erythrocytes resulted in a reduction of band intensity at the highest parasite density, and a reduction of lower detection threshold by ten-fold on all three brands of RDTs tested. Conclusions These observations indicate possible reduced sensitivity for diagnosis of P. falciparum malaria using PfHRP2-detecting RDTs among people with high levels of specific antibodies and low density infection, as well as possible interference with tests configured to detect soluble PfHRP2 in saliva or urine samples. Further investigations are required to assess the impact of pre-formed anti-PfHRP2 antibodies on RDT performance in different transmission settings.
Resumo:
Background: A major challenge for assessing students’ conceptual understanding of STEM subjects is the capacity of assessment tools to reliably and robustly evaluate student thinking and reasoning. Multiple-choice tests are typically used to assess student learning and are designed to include distractors that can indicate students’ incomplete understanding of a topic or concept based on which distractor the student selects. However, these tests fail to provide the critical information uncovering the how and why of students’ reasoning for their multiple-choice selections. Open-ended or structured response questions are one method for capturing higher level thinking, but are often costly in terms of time and attention to properly assess student responses. Purpose: The goal of this study is to evaluate methods for automatically assessing open-ended responses, e.g. students’ written explanations and reasoning for multiple-choice selections. Design/Method: We incorporated an open response component for an online signals and systems multiple-choice test to capture written explanations of students’ selections. The effectiveness of an automated approach for identifying and assessing student conceptual understanding was evaluated by comparing results of lexical analysis software packages (Leximancer and NVivo) to expert human analysis of student responses. In order to understand and delineate the process for effectively analysing text provided by students, the researchers evaluated strengths and weakness for both the human and automated approaches. Results: Human and automated analyses revealed both correct and incorrect associations for certain conceptual areas. For some questions, that were not anticipated or included in the distractor selections, showing how multiple-choice questions alone fail to capture the comprehensive picture of student understanding. The comparison of textual analysis methods revealed the capability of automated lexical analysis software to assist in the identification of concepts and their relationships for large textual data sets. We also identified several challenges to using automated analysis as well as the manual and computer-assisted analysis. Conclusions: This study highlighted the usefulness incorporating and analysing students’ reasoning or explanations in understanding how students think about certain conceptual ideas. The ultimate value of automating the evaluation of written explanations is that it can be applied more frequently and at various stages of instruction to formatively evaluate conceptual understanding and engage students in reflective
Resumo:
The controlled synthesis of nanostructured materials remains an ongoing area of research, especially as the size, shape and composition of nanomaterials can greatly influence their properties and applications. In this work we present the electrodeposition of highly dendritic platinum rich platinum-lead nanostructures, where lead acetate acts as an inorganic shape directing agent via underpotential deposition on the growing electrodeposit. It was found that these nanomaterials readily oxidise at potentials below monolayer oxide formation, which significantly impacts on the methanol electrooxidation reaction and correlates with the incipient hydrous oxide adatom mediator (IHOAM) model of electrocatalysis. Additionally these materials were tested for their surface enhanced Raman scattering (SERS) activity, where the high density of sharp tips provides promise for their application as SERS substrates.
Resumo:
Evidence is needed for the acceptability and user preferences of receiving skin cancer-related text messages. We prepared 27 questions to evaluate attitudes, satisfaction with program characteristics such as timing and spacing, and overall satisfaction with the Healthy Text program in young adults. Within this randomised controlled trial (age 18-42 years), 546 participants were assigned to one of three Healthy Text message groups; sun protection, skin self-examination, or attention-control. Over a 12-month period, 21 behaviour-specific text messages were sent to each group. Participants’ preferences were compared between the two interventions and control group at the 12-month follow-up telephone interview. In all three groups, participants reported the messages were easy to understand (98%), provided good suggestions or ideas (88%), and were encouraging (86%) and informative (85%) with little difference between the groups. The timing of the texts was received positively (92%); however, some suggestions for frequency or time of day the messages were received from 8% of participants. Participants in the two intervention groups found their messages more informative, and triggering behaviour change compared to control. Text messages about skin cancer prevention and early detection are novel and acceptable to induce behaviour change in young adults.
Resumo:
It is a big challenge to guarantee the quality of discovered relevance features in text documents for describing user preferences because of large scale terms and data patterns. Most existing popular text mining and classification methods have adopted term-based approaches. However, they have all suffered from the problems of polysemy and synonymy. Over the years, there has been often held the hypothesis that pattern-based methods should perform better than term-based ones in describing user preferences; yet, how to effectively use large scale patterns remains a hard problem in text mining. To make a breakthrough in this challenging issue, this paper presents an innovative model for relevance feature discovery. It discovers both positive and negative patterns in text documents as higher level features and deploys them over low-level features (terms). It also classifies terms into categories and updates term weights based on their specificity and their distributions in patterns. Substantial experiments using this model on RCV1, TREC topics and Reuters-21578 show that the proposed model significantly outperforms both the state-of-the-art term-based methods and the pattern based methods.
Resumo:
This chapter presents data produced by a research project that looked at pedagogy for print and digital literacies in a high poverty, high diversity primary school. The student population included refugee, immigrant and Aboriginal and Torres Strait Islander young people. In an environment in which schools, such as the study site, are under pressure to narrow the curriculum to ‘the basics’, the project sought to support teachers as they worked to create a rich curriculum for all students. The chapter will focus on pedagogy in an after-school media club. The aim of the club, which ran weekly for several years, was to build students’ media literacy skills. The data suggest that established ways of scaffolding linguistic texts cannot be simply transferred to multimodal text production. The chapter will also address implications from the research outcomes for other teachers working with At Risk EAL students.