59 resultados para Sentiment Analysis Opinion Mining Text Mining Twitter
Resumo:
Advances in hardware and software technology enable us to collect, store and distribute large quantities of data on a very large scale. Automatically discovering and extracting hidden knowledge in the form of patterns from these large data volumes is known as data mining. Data mining technology is not only a part of business intelligence, but is also used in many other application areas such as research, marketing and financial analytics. For example medical scientists can use patterns extracted from historic patient data in order to determine if a new patient is likely to respond positively to a particular treatment or not; marketing analysts can use extracted patterns from customer data for future advertisement campaigns; finance experts have an interest in patterns that forecast the development of certain stock market shares for investment recommendations. However, extracting knowledge in the form of patterns from massive data volumes imposes a number of computational challenges in terms of processing time, memory, bandwidth and power consumption. These challenges have led to the development of parallel and distributed data analysis approaches and the utilisation of Grid and Cloud computing. This chapter gives an overview of parallel and distributed computing approaches and how they can be used to scale up data mining to large datasets.
Resumo:
This article examines Corporate Social Responsibility (CSR) and mining community development, sustainability and viability. These issues are considered focussing on current and former company-owned mining towns in Namibia. Historically company towns have been a feature of mining activity in Namibia. However, the fate of such towns upon mine closure has been and remains controversial. Declining former mining communities and even ghost mining towns can be found across the country. This article draws upon research undertaken in Namibia and considers these issues with reference to three case study communities. This article examines the complexities which surround decision-making about these communities, and the challenges faced in efforts to encourage their sustainability after mining. In this article, mine company engagements through CSR with the development, sustainability and viability of such communities are also critically discussed. The role, responsibilities, and actions of the state in relation to these communities are furthermore reflected upon. Finally, ways forward for these communities are considered.
Resumo:
n the past decade, the analysis of data has faced the challenge of dealing with very large and complex datasets and the real-time generation of data. Technologies to store and access these complex and large datasets are in place. However, robust and scalable analysis technologies are needed to extract meaningful information from these datasets. The research field of Information Visualization and Visual Data Analytics addresses this need. Information visualization and data mining are often used complementary to each other. Their common goal is the extraction of meaningful information from complex and possibly large data. However, though data mining focuses on the usage of silicon hardware, visualization techniques also aim to access the powerful image-processing capabilities of the human brain. This article highlights the research on data visualization and visual analytics techniques. Furthermore, we highlight existing visual analytics techniques, systems, and applications including a perspective on the field from the chemical process industry.
Resumo:
This article examines the marginal position of artisanal miners in sub-Saharan Africa, and considers how they are incorporated into mineral sector change in the context of institutional and legal integration. Taking the case of diamond and gold mining in Tanzania, the concept of social exclusion is used to explore the consequences of marginalization on people's access to mineral resources and ability to make a living from artisanal mining. Because existing inequalities and forms of discrimination are ignored by the Tanzanian state, the institutionalization of mineral titles conceals social and power relations that perpetuate highly unequal access to resources. The article highlights the complexity of these processes, and shows that while legal integration can benefit certain wealthier categories of people, who fit into the model of an 'entrepreneurial small-scale miner', for others adverse incorporation contributes to socio-economic dependence, exploitation and insecurity. For the issue of marginality to be addressed within integration processes, the existence of local forms of organization, institutions and relationships, which underpin inequalities and discrimination, need to be recognized.
Resumo:
In the context of environmental valuation of natural disasters, an important component of the evaluation procedure lies in determining the periodicity of events. This paper explores alternative methodologies for determining such periodicity, illustrating the advantages and the disadvantages of the separate methods and their comparative predictions. The procedures employ Bayesian inference and explore recent advances in computational aspects of mixtures methodology. The procedures are applied to the classic data set of Maguire et al (Biometrika, 1952) which was subsequently updated by Jarrett (Biometrika, 1979) and which comprise the seminal investigations examining the periodicity of mining disasters within the United Kingdom, 1851-1962.
Resumo:
Owing to continuous advances in the computational power of handheld devices like smartphones and tablet computers, it has become possible to perform Big Data operations including modern data mining processes onboard these small devices. A decade of research has proved the feasibility of what has been termed as Mobile Data Mining, with a focus on one mobile device running data mining processes. However, it is not before 2010 until the authors of this book initiated the Pocket Data Mining (PDM) project exploiting the seamless communication among handheld devices performing data analysis tasks that were infeasible until recently. PDM is the process of collaboratively extracting knowledge from distributed data streams in a mobile computing environment. This book provides the reader with an in-depth treatment on this emerging area of research. Details of techniques used and thorough experimental studies are given. More importantly and exclusive to this book, the authors provide detailed practical guide on the deployment of PDM in the mobile environment. An important extension to the basic implementation of PDM dealing with concept drift is also reported. In the era of Big Data, potential applications of paramount importance offered by PDM in a variety of domains including security, business and telemedicine are discussed.
Resumo:
The paper investigates how energy-intensive industries respond to the recent government-led carbon emission schemes through the content analysis of 306 annual and standalone reports of 25 UK listed companies from 2004 to 2012. This period of reporting captures the trend and development of corporate disclosures on carbon emissions after the launch of EU Emissions Trading Schemes (ETS) and Climate Change Act (CCA) 2008. It is found that in corresponding to strategic legitimacy theory, there is an increase in both the quality and quantity of carbon disclosures as a response to these initiatives. However, the change is gradual, which reflects in the achievement of peak disclosure period two years after the launch. It indicates that the new legislations have a lasting impact on the discourses rather than an immediate legitimacy threat from the perspective of institutional legitimacy theory. The results also show that carbon disclosures are an institutionalised practice as companies in the same industries and/or with same carbon trading account status appear to imitate and adopt the industry’s ‘best practice’ disclosure strategy to maintain legitimacy. The trend analysis suggests that the overall disclosure practice is still in its infant stage, especially in the reporting of quantitative and monetary items. The paper contributes to the social and environmental accounting literature by adopting both strategic and institutional view of legitimacy, which explains why carbon disclosures evolve in a specific way to meet the expectation of various stakeholders.
Resumo:
Twitter is both a micro-blogging service and a platform for public conversation. Direct conversation is facilitated in Twitter through the use of @’s (mentions) and replies. While the conversational element of Twitter is of particular interest to the marketing sector, relatively few data-mining studies have focused on this area. We analyse conversations associated with reciprocated mentions that take place in a data-set consisting of approximately 4 million tweets collected over a period of 28 days that contain at least one mention. We ignore tweet content and instead use the mention network structure and its dynamical properties to identify and characterise Twitter conversations between pairs of users and within larger groups. We consider conversational balance, meaning the fraction of content contributed by each party. The goal of this work is to draw out some of the mechanisms driving conversation in Twitter, with the potential aim of developing conversational models.
Resumo:
For its advocates, corporate social responsibility (CSR) represents a powerful tool through which business and particularly multinationals can play a more direct role in global sustainable development. For its critics, however, CSR rarely goes beyond business as usual, and is often a cover for business practices with negative implications for communities and the environment. This paper explores the relationship between CSR and sustainable development in the context of mining in Namibia. Drawing upon extant literatures on the geographies of responsibility, and referencing in-country empirical case-study research, a critical relational lens is applied to consider their interaction both historically and in the present.
Resumo:
This paper addresses the economics of Enhanced Landfill Mining (ELFM) both from a private point of view as well as from a society perspective. The private potential is assessed using a case study for which an investment model is developed to identify the impact of a broad range of parameters on the profitability of ELFM. We found that especially variations in Waste-to-Energy (WtE efficiency, electricity price, CO2-price, WtE investment and operational costs) and ELFM support explain the variation in economic profitability measured by the Internal Rate of Return. To overcome site-specific parameters we also evaluated the regional ELFM potential for the densely populated and industrial region of Flanders (north of Belgium). The total number of potential ELFM sites was estimated using a 5-step procedure and a simulation tool was developed to trade-off private costs and benefits. The analysis shows that there is a substantial economic potential for ELFM projects on the wider regional level. Furthermore, this paper also reviews the costs and benefits from a broader perspective. The carbon footprint of the case study was mapped in order to assess the project’s net impact in terms of greenhouse gas emissions. Also the impacts of nature restoration, soil remediation, resource scarcity and reduced import dependence were valued so that they can be used in future social cost-benefit analysis. Given the complex trade-off between economic, social and environmental issues of ELFM projects, we conclude that further refinement of the methodological framework and the development of the integrated decision tools supporting private and public actors, are necessary.
Resumo:
Classical regression methods take vectors as covariates and estimate the corresponding vectors of regression parameters. When addressing regression problems on covariates of more complex form such as multi-dimensional arrays (i.e. tensors), traditional computational models can be severely compromised by ultrahigh dimensionality as well as complex structure. By exploiting the special structure of tensor covariates, the tensor regression model provides a promising solution to reduce the model’s dimensionality to a manageable level, thus leading to efficient estimation. Most of the existing tensor-based methods independently estimate each individual regression problem based on tensor decomposition which allows the simultaneous projections of an input tensor to more than one direction along each mode. As a matter of fact, multi-dimensional data are collected under the same or very similar conditions, so that data share some common latent components but can also have their own independent parameters for each regression task. Therefore, it is beneficial to analyse regression parameters among all the regressions in a linked way. In this paper, we propose a tensor regression model based on Tucker Decomposition, which identifies not only the common components of parameters across all the regression tasks, but also independent factors contributing to each particular regression task simultaneously. Under this paradigm, the number of independent parameters along each mode is constrained by a sparsity-preserving regulariser. Linked multiway parameter analysis and sparsity modeling further reduce the total number of parameters, with lower memory cost than their tensor-based counterparts. The effectiveness of the new method is demonstrated on real data sets.
Resumo:
Strategies to Reduce Emissions from Deforestation and Degradation (REDD) are being pursued in numerous developing countries. Proponents contest that REDD mechanisms could deliver sustainable development by contributing to both environmental protection and economic development, particularly in poor forest communities. However, among the challenges to REDD, and natural resource management more generally, is the need to develop a comprehensive understanding of cross-sectoral linkages and addressing how they impact the pursuit of sustainable development. Drawing on an exploratory case-study of Ghana, this paper aims to outline the linkages between the forestry and minerals sectors. It is argued that contemporary debates give incommensurate attention to the reclamation of large-scale mine sites located in forest reserves, and neglect to consider more nuanced links which characterise the forestry-mining nexus in Ghana. A review of key stakeholders further elucidates the complex networks which characterise these linkages and highlights the important role of traditional authorities in governing across sectors. If the multiple roles of local resource users and traditional authorities continue to be neglected in policy mechanisms, schemes such as REDD will continue to fall short of achieving sustainable development.
Resumo:
This paper provides an interdisciplinary perspective on mine reclamation in forested areas of Ghana, a country characterised by conflicts between mining and forest conservation. A comparison was made between above ground biomass (AGB) and soil organic carbon (SOC) content from two reclaimed mine sites and adjacent undisturbed forest. Findings suggest that on decadal timescales, reclaimed mine sites contain approximately 40% of the total carbon and 10% the AGB carbon of undisturbed forest. This raises questions regarding the potential for decommissioning mine sites to provide forestry-based legacies. Such a move could deliver a host of benefits, including improving the longevity and success of reclamation, mitigating climate change and delivering corollary enumeration for local communities under carbon trading schemes. A discussion of the antecedents and challenges associated with establishing forest-legacies highlights the risk of neglecting the participation and heterogeneity of legitimate local representatives, which threatens the equity of potential benefits and sustainability of projects. Despite these risks, implementing pilot projects could help to address the lack of transparency and data which currently characterises mine reclamation.
Resumo:
The General Election for the 56th United Kingdom Parliament was held on 7 May 2015. Tweets related to UK politics, not only those with the specific hashtag ”#GE2015”, have been collected in the period between March 1 and May 31, 2015. The resulting dataset contains over 28 million tweets for a total of 118 GB in uncompressed format or 15 GB in compressed format. This study describes the method that was used to collect the tweets and presents some analysis, including a political sentiment index, and outlines interesting research directions on Big Social Data based on Twitter microblogging.