978 resultados para Short-text clustering
Resumo:
In the present study we focus on the interaction between the acquisition of new words and text organisation. In the acquisition of new words we emphasise the acquisition of paradigmatic relations such as hyponymy, meronymy and semantic sets. We work with a group of girls attending a private school for adolescents in serious difficulties. The subjects are from disadvantaged families. Their writing skills were very poor. When asked to describe a garden, they write a short text of a single paragraph, the lexical items were generic, there were no adjectives, and all of them use mainly existential verbs. The intervention plan assumed that subjects must to be exposed to new words, working out its meaning. In presence of referents subjects were taught new words making explicit the intended relation of the new term to a term already known. In the classroom subjects were asked to write all the words they knew drawing the relationships among them. They talk about the words specifying the relation making explicit pragmatic directions like is a kind of, is a part of or are all x. After that subjects were exposed to the task of choosing perspective. The work presented in this paper accounts for significant differences in the text of the subjects before and after the intervention. While working new words subjects were organising their lexicon and learning to present a whole entity in perspective.
Resumo:
Dissertação de Mestrado apresentado ao Instituto de Contabilidade e Administração do Porto para a obtenção do grau de Mestre em Marketing Digital sob orientação de Sandrina Teixeira Anabela Ribeiro
Resumo:
The objective of this work was to assess the time-related action of probiotic Lactobacillus plantarum in the bacterial microbiota of the digestive tract of Litopenaeus vannamei, and the relation of total haemocyte count and serum phenol oxidase activity of shrimp challenged with Vibrio harveyi. Shrimps were fed with a probiotic-supplemented diet, for eight days, then shifted to a commercial diet. Shrimps fed only with the commercial diet served as control. Evaluations were made on the 8th day of experiment and repeated two, four, six and eight days later. Total lactic bacteria in the digestive tract was higher until the 4th day of evaluation in the probiotic-supplemented group. Vibrio spp. counts were higher in the control at days zero and two. Until the 4th day of evaluation, the total haemocyte counts in shrimps after challenge with V. harveyi were higher in probiotic-supplemented group than in control group. Significant difference was not observed in phenol oxidase activity. On the 6th day after shifting from supplemented to control diet, all parameters were equal in both groups, suggesting that the time-related action of L. plantarum in shrimp is short.
Resumo:
Interpretation of utterances affects an interrogator’s determination of human from machine during live Turing tests. Here, we consider transcripts realised as a result of a series of practical Turing tests that were held on 23 June 2012 at Bletchley Park, England. The focus in this paper is to consider the effects of lying and truth-telling on the human judges by the hidden entities, whether human or a machine. Turing test transcripts provide a glimpse into short text communication, the type that occurs in emails: how does the reader determine truth from the content of a stranger’s textual message? Different types of lying in the conversations are explored, and the judge’s attribution of human or machine is investigated in each test.
Resumo:
The extended flight of the Airborne Ionospheric Observatory during the Geospace Environment Modeling (GEM) Pilot program on January 16, 1990, allowed continuous all-sky monitoring of the two-dimensional ionospheric footprint of the northward interplanetary magnetic field (IMF) cusp in several wavelengths. Especially important in determining the locus of magnetosheath electron precipitation was the 630.0-nm red line emission. The most striking morphological change in the images was the transient appearance of zonally elongated regions of enhanced 630.0-nm emission which resembled “rays” emanating from the centroid of the precipitation. The appearance of these rays was strongly correlated with the Y component of the IMF: when the magnitude of By was large compared to Bz, the rays appeared; otherwise, the distribution was relatively unstructured. Late in the flight the field of view of the imager included the field of view of flow measurements from the European incoherent scatter radar (EISCAT). The rays visible in 630.0-nm emission exactly aligned with the position of strong flow jets observed by EISCAT. We attribute this correspondence to the requirement of quasi-neutrality; namely, the soft electrons have their largest precipitating fluxes where the bulk of the ions precipitate. The ions, in regions of strong convective flow, are spread out farther along the flow path than in regions of weaker flow. The occurrence and direction of these flow bursts are controlled by the IMF in a manner consistent with newly opened flux tubes; i.e., when |By| > |Bz|, tension in the reconnected field lines produce east-west flow regions downstream of the ionospheric projection of the x line. We interpret the optical rays (flow bursts), which typically last between 5 and 15 min, as evidence of periods of enhanced dayside (or lobe) reconnection when |By| > |Bz|. The length of the reconnection pulse is difficult to determine, however, since strong zonal flows would be expected to persist until the tension force in the field line has decayed, even if the duration of the enhanced reconnection was relatively short.
Resumo:
La presente Tesi di Dottorato intende affrontare una lettura critica della Casa in Belvederestraße 60, realizzata dall’architetto Oswald Mathias Ungers (Kaisersesch, 12 luglio 1926 – Köln, 30 settembre 2007), nel 1958-’59 a Köln-Müngersdorf, come studio per sé ed abitazione per la propria famiglia. Questo primo oggetto della ricerca viene considerato evidente espressione delle convinzioni formali e compositive dell’architetto, negli anni Cinquanta e Sessanta. A differenza di altri progetti residenziali coevi ed antecedenti, frutto di un’elaborazione autonoma, la prima casa che costruisce per sé riflette una maggiore libertà di pensiero, dettata dalla coincidenza delle figure di progettista e committente; a ciò si aggiunge anche una precisa volontà dichiarativa ed ideologica. Proprio quest’ultimo aspetto permette di introdurre il secondo oggetto della Tesi: il manifesto “ideologico”, Zu einer neuen Architektur, scritto dallo stesso Oswald Mathias Ungers e da Reinhard Gieselmann, alla fine del 1960; un breve testo che espone, con toni perentori ed inappellabili, il punto di vista dei due architetti nei confronti di un panorama architettonico e critico, caratterizzato da una sterilità di pensiero dilagante, a causa dell’egemonia costruttiva funzionalista. La ricerca indaga quindi le forti reciprocità delle due opere: casa e testo, viste in chiave di “manifesto scritto e manifesto costruito”. Il primo legame tra i due soggetti è senza dubbio la concomitanza temporale, (tra il 1958 ed il 1960) associata ad un rapporto causa-effetto, tale per cui il manifesto viene redatto a difesa delle aspre critiche scaturite dalla pubblicazione della casa sulla rivista Bauwelt. Il secondo nesso è la possibilità di comprendere le accezioni effettive dei termini impiegati nella redazione del testo, attraverso le forme di una delle opere maggiormente personali dell’architetto, estraendone il senso e conferendogli un’immagine architettonica. Si vuole creare così un rapporto biunivoco di traducibilità, dell’architettura nello scritto e della semantica ungersiana in azioni compositive.
Resumo:
According to the colophon (f. 117v), copy completed in the hand of ʻAbd al-Razzāq ibn Muḥammad Ḥusayn al-Yazdī in 1240 AH [December 1824-5 AD].
Resumo:
As microblog services such as Twitter become a fast and convenient communication approach, identification of trendy topics in microblog services has great academic and business value. However detecting trendy topics is very challenging due to huge number of users and short-text posts in microblog diffusion networks. In this paper we introduce a trendy topics detection system under computation and communication resource constraints. In stark contrast to retrieving and processing the whole microblog contents, we develop an idea of selecting a small set of microblog users and processing their posts to achieve an overall acceptable trendy topic coverage, without exceeding resource budget for detection. We formulate the selection operation of these subset users as mixed-integer optimization problems, and develop heuristic algorithms to compute their approximate solutions. The proposed system is evaluated with real-time test data retrieved from Sina Weibo, the dominant microblog service provider in China. It's shown that by monitoring 500 out of 1.6 million microblog users and tracking their microposts (about 15,000 daily) with our system, nearly 65% trendy topics can be detected, while on average 5 hours earlier before they appear in Sina Weibo official trends.
Resumo:
Short text messages a.k.a Microposts (e.g. Tweets) have proven to be an effective channel for revealing information about trends and events, ranging from those related to Disaster (e.g. hurricane Sandy) to those related to Violence (e.g. Egyptian revolution). Being informed about such events as they occur could be extremely important to authorities and emergency professionals by allowing such parties to immediately respond. In this work we study the problem of topic classification (TC) of Microposts, which aims to automatically classify short messages based on the subject(s) discussed in them. The accurate TC of Microposts however is a challenging task since the limited number of tokens in a post often implies a lack of sufficient contextual information. In order to provide contextual information to Microposts, we present and evaluate several graph structures surrounding concepts present in linked knowledge sources (KSs). Traditional TC techniques enrich the content of Microposts with features extracted only from the Microposts content. In contrast our approach relies on the generation of different weighted semantic meta-graphs extracted from linked KSs. We introduce a new semantic graph, called category meta-graph. This novel meta-graph provides a more fine grained categorisation of concepts providing a set of novel semantic features. Our findings show that such category meta-graph features effectively improve the performance of a topic classifier of Microposts. Furthermore our goal is also to understand which semantic feature contributes to the performance of a topic classifier. For this reason we propose an approach for automatic estimation of accuracy loss of a topic classifier on new, unseen Microposts. We introduce and evaluate novel topic similarity measures, which capture the similarity between the KS documents and Microposts at a conceptual level, considering the enriched representation of these documents. Extensive evaluation in the context of Emergency Response (ER) and Violence Detection (VD) revealed that our approach outperforms previous approaches using single KS without linked data and Twitter data only up to 31.4% in terms of F1 measure. Our main findings indicate that the new category graph contains useful information for TC and achieves comparable results to previously used semantic graphs. Furthermore our results also indicate that the accuracy of a topic classifier can be accurately predicted using the enhanced text representation, outperforming previous approaches considering content-based similarity measures. © 2014 Elsevier B.V. All rights reserved.
Resumo:
Topic classification (TC) of short text messages offers an effective and fast way to reveal events happening around the world ranging from those related to Disaster (e.g. Sandy hurricane) to those related to Violence (e.g. Egypt revolution). Previous approaches to TC have mostly focused on exploiting individual knowledge sources (KS) (e.g. DBpedia or Freebase) without considering the graph structures that surround concepts present in KSs when detecting the topics of Tweets. In this paper we introduce a novel approach for harnessing such graph structures from multiple linked KSs, by: (i) building a conceptual representation of the KSs, (ii) leveraging contextual information about concepts by exploiting semantic concept graphs, and (iii) providing a principled way for the combination of KSs. Experiments evaluating our TC classifier in the context of Violence detection (VD) and Emergency Responses (ER) show promising results that significantly outperform various baseline models including an approach using a single KS without linked data and an approach using only Tweets. Copyright 2013 ACM.
Resumo:
With the development of information technology, the theory and methodology of complex network has been introduced to the language research, which transforms the system of language in a complex networks composed of nodes and edges for the quantitative analysis about the language structure. The development of dependency grammar provides theoretical support for the construction of a treebank corpus, making possible a statistic analysis of complex networks. This paper introduces the theory and methodology of the complex network and builds dependency syntactic networks based on the treebank of speeches from the EEE-4 oral test. According to the analysis of the overall characteristics of the networks, including the number of edges, the number of the nodes, the average degree, the average path length, the network centrality and the degree distribution, it aims to find in the networks potential difference and similarity between various grades of speaking performance. Through clustering analysis, this research intends to prove the network parameters’ discriminating feature and provide potential reference for scoring speaking performance.
Resumo:
International audience
Resumo:
Le opere di Federigo Tozzi hanno conosciuto una fortuna critica sempre crescente, nel corso del Novecento, sebbene i testi delle stampe che ancora oggi leggiamo siano segnati dal destino sfortunato del loro autore: Tozzi, morendo a soli trentasette anni, non poté curare né approvare le edizioni di una parte cospicua dei propri scritti. Così, anche il te-sto che costituisce la vulgata di un romanzo caposaldo come Il podere è quello stabilito da Glauco Tozzi, figlio dell’autore, negli anni Sessanta per Vallecchi, a sua volta larga-mente fondato sulla princeps Treves curata dalla vedova dell’autore, Emma Palagi. La presente tesi di dottorato propone l’edizione critica del Podere di Tozzi, inseren-dosi nei lavori per le Edizioni Nazionali cui è destinata. La tesi pertanto è strutturata se-condo una scansione tipica dell’edizione di studio: una nota al testo, contenente le de-scrizioni e le analisi dei testimoni originali, la ricostruzione cronologica della genesi del romanzo e l’ipotesi di lavoro adottata. Dopo la dichiarazione delle norme per la costitu-zione di testo e apparato, segue il testo critico del romanzo, con apparato genetico a piè di pagina. La tesi si conclude con tre appendici, la prima contenente il testo parziale del-le ultime bozze di stampa del romanzo, corredate di apparato che ne descrive le diffe-renze rispetto al testo dattiloscritto e le correzioni d’autore qui aggiunte. Alla seconda appendice è invece destinato il breve testo di Luigia, che è quanto rimane testimoniato di un sequel del Podere. Nella terza appendice sono raccolte le riproduzioni fotografiche di alcune carte originali, utili durante la lettura della nota al testo.