Biblioteca Digital

765 resultados para Sentiment Analysis, Opinion Mining, Twitter

Originator or propagator? Incorporating social role theory into topic models for twitter content analysis

Relevância:

40.00% 40.00%

Publicador:

Resumo:

A large number of studies have been devoted to modeling the contents and interactions between users on Twitter. In this paper, we propose a method inspired from Social Role Theory (SRT), which assumes that a user behaves differently in different roles in the generation process of Twitter content. We consider the two most distinctive social roles on Twitter: originator and propagator, who respectively posts original messages and retweets or forwards the messages from others. In addition, we also consider role-specific social interactions, especially implicit interactions between users who share some common interests. All the above elements are integrated into a novel regularized topic model. We evaluate the proposed method on real Twitter data. The results show that our method is more effective than the existing ones which do not distinguish social roles. Copyright 2013 ACM.

Analysis and identification of spamming behaviors in Sina Weibo microblog

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Spamming has been a widespread problem for social networks. In recent years there is an increasing interest in the analysis of anti-spamming for microblogs, such as Twitter. In this paper we present a systematic research on the analysis of spamming in Sina Weibo platform, which is currently a dominant microblogging service provider in China. Our research objectives are to understand the specific spamming behaviors in Sina Weibo and find approaches to identify and block spammers in Sina Weibo based on spamming behavior classifiers. To start with the analysis of spamming behaviors we devise several effective methods to collect a large set of spammer samples, including uses of proactive honeypots and crawlers, keywords based searching and buying spammer samples directly from online merchants. We processed the database associated with these spammer samples and interestingly we found three representative spamming behaviors: Aggressive advertising, repeated duplicate reposting and aggressive following. We extract various features and compare the behaviors of spammers and legitimate users with regard to these features. It is found that spamming behaviors and normal behaviors have distinct characteristics. Based on these findings we design an automatic online spammer identification system. Through tests with real data it is demonstrated that the system can effectively detect the spamming behaviors and identify spammers in Sina Weibo.

Sentiment-Topic Modelling in Text Mining

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Peer reviewed

Sentiment-Topic Modelling in Text Mining

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Peer reviewed

Sentiment-Topic Modelling in Text Mining

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Peer reviewed

UTILIZATION OF GIS AND SPATIAL ANALYSIS TECHNIQUES IN ARTISANAL AND SMALL SCALE MINING TO LOCATE A CENTRALIZED PROCESSING CENTRE

Relevância:

40.00% 40.00%

Publicador:

Resumo:

One of the global phenomena with threats to environmental health and safety is artisanal mining. There are ambiguities in the manner in which an ore-processing facility operates which hinders the mining capacity of these miners in Ghana. These problems are reviewed on the basis of current socio-economic, health and safety, environmental, and use of rudimentary technologies which limits fair-trade deals to miners. This research sought to use an established data-driven, geographic information (GIS)-based system employing the spatial analysis approach for locating a centralized processing facility within the Wassa Amenfi-Prestea Mining Area (WAPMA) in the Western region of Ghana. A spatial analysis technique that utilizes ModelBuilder within the ArcGIS geoprocessing environment through suitability modeling will systematically and simultaneously analyze a geographical dataset of selected criteria. The spatial overlay analysis methodology and the multi-criteria decision analysis approach were selected to identify the most preferred locations to site a processing facility. For an optimal site selection, seven major criteria including proximity to settlements, water resources, artisanal mining sites, roads, railways, tectonic zones, and slopes were considered to establish a suitable location for a processing facility. Site characterizations and environmental considerations, incorporating identified constraints such as proximity to large scale mines, forest reserves and state lands to site an appropriate position were selected. The analysis was limited to criteria that were selected and relevant to the area under investigation. Saaty’s analytical hierarchy process was utilized to derive relative importance weights of the criteria and then a weighted linear combination technique was applied to combine the factors for determination of the degree of potential site suitability. The final map output indicates estimated potential sites identified for the establishment of a facility centre. The results obtained provide intuitive areas suitable for consideration

Data Mining for Network Intrusion Detection : A comparison of data mining algorithms and an analysis of relevant features for detecting cyber-attacks

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Data mining can be defined as the extraction of implicit, previously un-known, and potentially useful information from data. Numerous re-searchers have been developing security technology and exploring new methods to detect cyber-attacks with the DARPA 1998 dataset for Intrusion Detection and the modified versions of this dataset KDDCup99 and NSL-KDD, but until now no one have examined the performance of the Top 10 data mining algorithms selected by experts in data mining. The compared classification learning algorithms in this thesis are: C4.5, CART, k-NN and Naïve Bayes. The performance of these algorithms are compared with accuracy, error rate and average cost on modified versions of NSL-KDD train and test dataset where the instances are classified into normal and four cyber-attack categories: DoS, Probing, R2L and U2R. Additionally the most important features to detect cyber-attacks in all categories and in each category are evaluated with Weka’s Attribute Evaluator and ranked according to Information Gain. The results show that the classification algorithm with best performance on the dataset is the k-NN algorithm. The most important features to detect cyber-attacks are basic features such as the number of seconds of a network connection, the protocol used for the connection, the network service used, normal or error status of the connection and the number of data bytes sent. The most important features to detect DoS, Probing and R2L attacks are basic features and the least important features are content features. Unlike U2R attacks, where the content features are the most important features to detect attacks.

No Joke: Understanding Public Sentiment Toward Selling and Salespeople Through Cartoon Analysis

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Unflattering representations of salesmanship in mass media exist in abundance. In order to gauge the depiction of selling in mass media, this article explores the nature and public perceptions of salesmanship using editorial cartoons. A theory of cartooning suggests that editorial cartoons reflect public sentiment toward events and issues and therefore provide a useful way of measuring and tracking such sentiment over time. The criteria of narrative, location, binary struggle, normative transference, and metaphor were used as a framework to analyze 286 cartoons over a 30-year period from 1983 to 2013. The results suggest that while representations of the characteristics and behaviors of salespeople shifted very little across time periods, changes in public perceptions of seller–buyer conflict, the role of the customer, and selling techniques were observed, thus indicating that cartoons are sensitive enough to measure the portrayal of selling.

Contextual text mining

Relevância:

40.00% 40.00%

Publicador:

Resumo:

With the dramatic growth of text information, there is an increasing need for powerful text mining systems that can automatically discover useful knowledge from text. Text is generally associated with all kinds of contextual information. Those contexts can be explicit, such as the time and the location where a blog article is written, and the author(s) of a biomedical publication, or implicit, such as the positive or negative sentiment that an author had when she wrote a product review; there may also be complex context such as the social network of the authors. Many applications require analysis of topic patterns over different contexts. For instance, analysis of search logs in the context of the user can reveal how we can improve the quality of a search engine by optimizing the search results according to particular users; analysis of customer reviews in the context of positive and negative sentiments can help the user summarize public opinions about a product; analysis of blogs or scientific publications in the context of a social network can facilitate discovery of more meaningful topical communities. Since context information significantly affects the choices of topics and language made by authors, in general, it is very important to incorporate it into analyzing and mining text data. In general, modeling the context in text, discovering contextual patterns of language units and topics from text, a general task which we refer to as Contextual Text Mining, has widespread applications in text mining. In this thesis, we provide a novel and systematic study of contextual text mining, which is a new paradigm of text mining treating context information as the ``first-class citizen.'' We formally define the problem of contextual text mining and its basic tasks, and propose a general framework for contextual text mining based on generative modeling of text. This conceptual framework provides general guidance on text mining problems with context information and can be instantiated into many real tasks, including the general problem of contextual topic analysis. We formally present a functional framework for contextual topic analysis, with a general contextual topic model and its various versions, which can effectively solve the text mining problems in a lot of real world applications. We further introduce general components of contextual topic analysis, by adding priors to contextual topic models to incorporate prior knowledge, regularizing contextual topic models with dependency structure of context, and postprocessing contextual patterns to extract refined patterns. The refinements on the general contextual topic model naturally lead to a variety of probabilistic models which incorporate different types of context and various assumptions and constraints. These special versions of the contextual topic model are proved effective in a variety of real applications involving topics and explicit contexts, implicit contexts, and complex contexts. We then introduce a postprocessing procedure for contextual patterns, by generating meaningful labels for multinomial context models. This method provides a general way to interpret text mining results for real users. By applying contextual text mining in the ``context'' of other text information management tasks, including ad hoc text retrieval and web search, we further prove the effectiveness of contextual text mining techniques in a quantitative way with large scale datasets. The framework of contextual text mining not only unifies many explorations of text analysis with context information, but also opens up many new possibilities for future research directions in text mining.

Using text-mining-assisted analysis to examine the applicability of unstructured data in the context of customer complaint management

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Double Degree

Analysis of the criticality of flaws found in trunnion of grinding ball mills used in mining plants

Relevância:

40.00% 40.00%

Publicador:

The frustrated accesion of the European Union to the European Convention of Human Rights: a brief analysis of opinion 2/13 of the Court of Justice of European Union

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Trabajo redactado en inglés sobre la última sentencia 2/13, del Tribunal de Justicia de Europa sobre la adhesión de la Unión Europea al Convenio Europeo de Derechos Humanos fundamentales. Análisis de la opinión 2/13 y sus objeciones.

Political ideologies and attitudes towards income inequality in the US: a critical discourse analysis of CNN and Fox News opinion articles

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The topic of the thesis is media discourse about current state if income inequality in the US, and political ideologies as influences behind the discourse. The data consists of four opinion articles, two from CNN and two from Fox News. The purpose of the study was to examine how media represents income inequality as an issue, and if the attitudes conveyed are concerned or indifferent. Previous studies have indicated that the level of income is often seen as a personal responsibility, and such perspective can be linked with Republican ideology. In contrast, the Democrats typically express more concern about the consequences of inequality. CNN has been previously considered to have a Democratic bias, and Fox News has been considered to have Republican bias, which is one reason why these two news channels were chosen as the sources of the data. The study is a critical discourse analysis, and the methods applied were sociocognitive approach, which analyzes the social and cognitive factors affecting the discourse, and appraisal framework, which was applied to scrutinize the expressed attitudes more closely by identifyind specific linguistic features. The appraisal framework includes studying such features as affect, judgment and appreciation, which offer a more detailed analysis on the attitudes present in the articles. The sociocognitive approach, additionally, offers a way of analyzing a more broad context affecting the articles. The findings were then compared, to see if there are differences between the articles, or between the news sites with alleged bias. The findings showed that CNN, with alleged Democratic bias, had a more symphatetic attitude towards income inequality, whereas Fox News, with more Republican views, showed clearly less concern towards the issue. Moreover, the Fox News articles had such dubious claims that the underlying ideology behind the articles could be even supporting of income inequality, as it allows the rich to pursue all the wealth they can without having to give anything away. The results, thus, suggest that the political ideologies may a significant effect on media discourse, which, in turn, may have a significant effect on the attitudes of the public towards great issues that could require prompt measures.

Business Process Management and Process Mining within a Real Business Environment: An Empirical Analysis of Event Logs Data in a Consulting Project

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Il presente elaborato esplora l’attitudine delle organizzazioni nei confronti dei processi di business che le sostengono: dalla semi-assenza di struttura, all’organizzazione funzionale, fino all’avvento del Business Process Reengineering e del Business Process Management, nato come superamento dei limiti e delle problematiche del modello precedente. All’interno del ciclo di vita del BPM, trova spazio la metodologia del process mining, che permette un livello di analisi dei processi a partire dagli event data log, ossia dai dati di registrazione degli eventi, che fanno riferimento a tutte quelle attività supportate da un sistema informativo aziendale. Il process mining può essere visto come naturale ponte che collega le discipline del management basate sui processi (ma non data-driven) e i nuovi sviluppi della business intelligence, capaci di gestire e manipolare l’enorme mole di dati a disposizione delle aziende (ma che non sono process-driven). Nella tesi, i requisiti e le tecnologie che abilitano l’utilizzo della disciplina sono descritti, cosi come le tre tecniche che questa abilita: process discovery, conformance checking e process enhancement. Il process mining è stato utilizzato come strumento principale in un progetto di consulenza da HSPI S.p.A. per conto di un importante cliente italiano, fornitore di piattaforme e di soluzioni IT. Il progetto a cui ho preso parte, descritto all’interno dell’elaborato, ha come scopo quello di sostenere l’organizzazione nel suo piano di improvement delle prestazioni interne e ha permesso di verificare l’applicabilità e i limiti delle tecniche di process mining. Infine, nell’appendice finale, è presente un paper da me realizzato, che raccoglie tutte le applicazioni della disciplina in un contesto di business reale, traendo dati e informazioni da working papers, casi aziendali e da canali diretti. Per la sua validità e completezza, questo documento è stata pubblicato nel sito dell'IEEE Task Force on Process Mining.

Text Based Knowledge Discovery with Information Flow Analysis

Relevância:

30.00% 30.00%

Publicador:

«
1
2
...
12
13
14
15
16
17
18
...
50
51
»