765 resultados para Sentiment Analysis, Opinion Mining, Twitter
Resumo:
En este trabajo aplicamos a la red social Twitter un modelo de análisis del discurso político y mediático desarrollado en publicaciones previas, que permite hacer compatible el estudio de los datos discursivos con propuestas explicativas surgidas a propósito de la comunicación política (neurocomunicación) y de la comunicación digital (la red como quinto estado, convergencia, inteligencia colectiva). Asumimos que hay categorías del encuadre discursivo (frame) que pueden ser tratadas como indicadores de habilidades cognitivas y comunicativas. Analizamos estas categorías agrupándolas en tres dimensiones fundamentales: la intencional (ilocutividad del tuit, encuadre interpretativo de las etiquetas), referencial (temas, protagonistas), e interactiva (alineamiento estructural, predictibilidad; marcas de intertextualidad y dialogismo; afiliación partidista). El corpus consta de 4116 tuits: 3000 tuits pertenecientes a los programas Al Rojo Vivo (La Sexta: A3 Media), Las Mañanas Cuatro (Cuatro: Mediaset) y Los Desayunos de TVE (RTVE), 1116 tuits de seguidores de los programas, que corresponden a 45 tuits de cada programa. Los resultados confirman que el modelo permite establecer diferentes perfiles de subjetividad política en las cuentas de Twitter.
Resumo:
The aim of this study is to determine which social agents are involved in the political debate on Twitter and whether the interpretive hegemony of actors that have traditionally been the most prominent is tempered by the challenge of framing shared with audiences. The relationship between the interpretations expressed and the profiles of participants is analyzed in comparison with the frames used by mainstream media. The chosen methodology combines content analysis and discourse analysis techniques on a sample of 1,504 relevant tweets posted on two political issues –the approval of the education law LOMCE and the evictions caused by the crisis, which have also been studied in the front pages of four leading newspapers in Spain. The results show a correlation between political issue singularities, frames and the type of discussion depending on the participants.
Resumo:
Data mining techniques extract repeated and useful patterns from a large data set that in turn are utilized to predict the outcome of future events. The main purpose of the research presented in this paper is to investigate data mining strategies and develop an efficient framework for multi-attribute project information analysis to predict the performance of construction projects. The research team first reviewed existing data mining algorithms, applied them to systematically analyze a large project data set collected by the survey, and finally proposed a data-mining-based decision support framework for project performance prediction. To evaluate the potential of the framework, a case study was conducted using data collected from 139 capital projects and analyzed the relationship between use of information technology and project cost performance. The study results showed that the proposed framework has potential to promote fast, easy to use, interpretable, and accurate project data analysis.
Resumo:
The research team recognized the value of network-level Falling Weight Deflectometer (FWD) testing to evaluate the structural condition trends of flexible pavements. However, practical limitations due to the cost of testing, traffic control and safety concerns and the ability to test a large network may discourage some agencies from conducting the network-level FWD testing. For this reason, the surrogate measure of the Structural Condition Index (SCI) is suggested for use. The main purpose of the research presented in this paper is to investigate data mining strategies and to develop a prediction method of the structural condition trends for network-level applications which does not require FWD testing. The research team first evaluated the existing and historical pavement condition, distress, ride, traffic and other data attributes in the Texas Department of Transportation (TxDOT) Pavement Maintenance Information System (PMIS), applied data mining strategies to the data, discovered useful patterns and knowledge for SCI value prediction, and finally provided a reasonable measure of pavement structural condition which is correlated to the SCI. To evaluate the performance of the developed prediction approach, a case study was conducted using the SCI data calculated from the FWD data collected on flexible pavements over a 5-year period (2005 – 09) from 354 PMIS sections representing 37 pavement sections on the Texas highway system. The preliminary study results showed that the proposed approach can be used as a supportive pavement structural index in the event when FWD deflection data is not available and help pavement managers identify the timing and appropriate treatment level of preventive maintenance activities.
Resumo:
Twitter is an important and influential social media platform, but much research into its uses remains centred around isolated cases – e.g. of events in political communication, crisis communication, or popular culture, often coordinated by shared hashtags (brief keywords, prefixed with the symbol ‘#’). In particular, a lack of standard metrics for comparing communicative patterns across cases prevents researchers from developing a more comprehensive perspective on the diverse, sometimes crucial roles which hashtags play in Twitter-based communication. We address this problem by outlining a catalogue of widely applicable, standardised metrics for analysing Twitter-based communication, with particular focus on hashtagged exchanges. We also point to potential uses for such metrics, presenting an indication of what broader comparisons of diverse cases can achieve.
Resumo:
Business process analysis and process mining, particularly within the health care domain, remain under-utilised. Applied research that employs such techniques to routinely collected, health care data enables stakeholders to empirically investigate care as it is delivered by different health providers. However, cross-organisational mining and the comparative analysis of processes present a set of unique challenges in terms of ensuring population and activity comparability, visualising the mined models and interpreting the results. Without addressing these issues, health providers will find it difficult to use process mining insights, and the potential benefits of evidence-based process improvement within health will remain unrealised. In this paper, we present a brief introduction on the nature of health care processes; a review of the process mining in health literature; and a case study conducted to explore and learn how health care data, and cross-organisational comparisons with process mining techniques may be approached. The case study applies process mining techniques to administrative and clinical data for patients who present with chest pain symptoms at one of four public hospitals in South Australia. We demonstrate an approach that provides detailed insights into clinical (quality of patient health) and fiscal (hospital budget) pressures in health care practice. We conclude by discussing the key lessons learned from our experience in conducting business process analysis and process mining based on the data from four different hospitals.
Resumo:
The mining industry has positioned itself within the sustainability agenda, particularly since the establishment of the International Council of Mining and Minerals (ICMM). However, some critics have questioned this position, since mining requires the extraction of non-renewable finite resources and commercial mining companies have the specific responsibility to produce profit. Complicating matters is that terms that represent the sustainability such as ‘sustainability’ and ‘sustainable development’ have multiple definitions with varying degrees of sophistication. This work identifies eleven sustainability agenda definitions that are applicable to the mining industry and organises them into three tiers: first, Perpetual Sustainability, that focuses on mining continuing indefinitely with its benefits limited to immediate shareholders; second, Transferable Sustainability, that focuses on how mining can benefit society and the environment and third, Transitional Sustainability, that focuses on the intergenerational benefits to society and the environment even after mining ceases. Using these definitions, a discourse analysis was performed on sustainability reports from member companies of the ICMM and the academic journal Resources Policy. The discourse analysis showed that in both media the definition of the sustainability agenda was focussed on Transferable Sustainability, with the sustainability reports focused on how it can be applied within a business context while the academic journal took a broader view of mining’s social and environmental impacts.
Resumo:
Telecommunications network management is based on huge amounts of data that are continuously collected from elements and devices from all around the network. The data is monitored and analysed to provide information for decision making in all operation functions. Knowledge discovery and data mining methods can support fast-pace decision making in network operations. In this thesis, I analyse decision making on different levels of network operations. I identify the requirements decision-making sets for knowledge discovery and data mining tools and methods, and I study resources that are available to them. I then propose two methods for augmenting and applying frequent sets to support everyday decision making. The proposed methods are Comprehensive Log Compression for log data summarisation and Queryable Log Compression for semantic compression of log data. Finally I suggest a model for a continuous knowledge discovery process and outline how it can be implemented and integrated to the existing network operations infrastructure.
Resumo:
Song-selection and mood are interdependent. If we capture a song’s sentiment, we can determine the mood of the listener, which can serve as a basis for recommendation systems. Songs are generally classified according to genres, which don’t entirely reflect sentiments. Thus, we require an unsupervised scheme to mine them. Sentiments are classified into either two (positive/negative) or multiple (happy/angry/sad/...) classes, depending on the application. We are interested in analyzing the feelings invoked by a song, involving multi-class sentiments. To mine the hidden sentimental structure behind a song, in terms of “topics”, we consider its lyrics and use Latent Dirichlet Allocation (LDA). Each song is a mixture of moods. Topics mined by LDA can represent moods. Thus we get a scheme of collecting similar-mood songs. For validation, we use a dataset of songs containing 6 moods annotated by users of a particular website.
Resumo:
O objetivo deste trabalho é investigar as características da linguagem no Twitter, focalizando (i) seu propósito comunicativo, (ii) seus participantes discursivos e (iii) suas relações interpessoais. Por acreditar que a linguagem é um recurso sistemático e que somente através dela expressamos significados em determinados contextos, encontramos na Linguística Sistêmico-Funcional (LSF) uma base teórica que se encaixa à pesquisa. Para Halliday(1994), a linguística é o estudo de como as pessoas negociam sentido através do uso da linguagem. Assim, encontramos no Twitter, um corpus diversificado que reforça ainda mais a teoria da LSF, quando afirma sermos nós, falantes da língua, os únicos responsáveis por nossas escolhas lexicais, tendo consciência de como e onde, contextualmente falando, podemos aplicar em uma atividade linguística em que estivermos engajados. O material de pesquisa foi constituído mediante a coleta inicial de 671 comentários postados no Twitter em 2010. Dados obtidos a partir da análise desta coleta confirmam o argumento de Crystal (2011), de que a expressão de opinião é o principal propósito comunicativo das mensagens postadas no microblogging. Assim, após recortes no corpus para coleta exclusivamente de opiniões, 201 tuítes resultantes de duas coletas realizadas em datas e situações diferentes foram analisados: uma, após notícia de agressão a uma professora; a segunda, momentos antes e durante a Copa Mundial de 2010. Os resultados apontam diferenças entre as amostras, principalmente em função de aspectos do contexto de situação: pois embora o tom seja de indignação nas amostras com tuítes opinativos, apenas na amostra futebol há tentativa de se orientar a ação do outro. Quanto às relações interpessoais, foram identificadas marcas de interação face a face nas duas amostras, mas apenas na amostra futebol identificou-se uso de linguagem de baixo calão. Finalmente, em relação às características gerais do Twitter, observa-se o uso de linguagem reduzida na forma de caracteres emotivos ou de abreviações, o uso de interjeições e pontos de exclamação. Observou-se ainda o uso recorrente de léxico valorativo, de ironia e de perguntas retóricas para expressão de indignação, mas estes traços parecem ser afetados por aspectos do contexto de situação, mais do que por características do Twitter
Resumo:
This paper focuses on document data, one of the most significant sources for technology intelligence. To help organisations use their knowledge in documents effectively, this research aims to identify what organizations really want from documents and what might be possible to obtain from them. The research involves a literature review, a series of in-depth/on-site interviews and a descriptive analysis of document mining applications. The output of the research includes: a document mining framework; an analysis of the current condition of document mining in technology-based organisations together with their future requirements; and guidelines for introducing document mining into an organisation along with a discussion on the practical issues that are faced by users. Copyright © 2011 Inderscience Enterprises Ltd.
Resumo:
This research proposes a method for extracting technology intelligence (TI) systematically from a large set of document data. To do this, the internal and external sources in the form of documents, which might be valuable for TI, are first identified. Then the existing techniques and software systems applicable to document analysis are examined. Finally, based on the reviews, a document-mining framework designed for TI is suggested and guidelines for software selection are proposed. The research output is expected to support intelligence operatives in finding suitable techniques and software systems for getting value from document-mining and thus facilitate effective knowledge management. Copyright © 2012 Inderscience Enterprises Ltd.
Resumo:
Twitter has changed the dynamic of the academic conference. Before Twitter, delegate participation was primarily dependent on attendance and feedback was limited to post-event survey. With Twitter, delegates have become active participants. They pass comment, share reactions and critique presentations, all the while generating a running commentary. This study examines this phenomenon using the Academic & Special Libraries (A&SL) conference 2015 (hashtag #asl2015) as a case study. A post-conference survey was undertaken asking delegates how and why they used Twitter at #asl2015. A content and conceptual analysis of tweets was conducted using Topsy and Storify. This analysis examined how delegates interacted with presentations, which sessions generated most activity on the timeline and the type of content shared. Actual tweet activity and volume per presentation was compared to survey responses. Finally, recommendations on Twitter engagement for conference organisers and presenters are provided.