Biblioteca Digital

20 resultados para Mine sanitation

Mining sentiments from songs using latent dirichlet allocation

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Song-selection and mood are interdependent. If we capture a song’s sentiment, we can determine the mood of the listener, which can serve as a basis for recommendation systems. Songs are generally classified according to genres, which don’t entirely reflect sentiments. Thus, we require an unsupervised scheme to mine them. Sentiments are classified into either two (positive/negative) or multiple (happy/angry/sad/...) classes, depending on the application. We are interested in analyzing the feelings invoked by a song, involving multi-class sentiments. To mine the hidden sentimental structure behind a song, in terms of “topics”, we consider its lyrics and use Latent Dirichlet Allocation (LDA). Each song is a mixture of moods. Topics mined by LDA can represent moods. Thus we get a scheme of collecting similar-mood songs. For validation, we use a dataset of songs containing 6 moods annotated by users of a particular website.

Veja mais

Urban water supply and management

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Population growth and rapid urbanization lead to considerable stress on already depleting water resources. A great challenge for water authorities of urban cities is to supply adequate and reliable safe water to all consumers. In most of the developing countries water scarcity and high demands have led the water authorities to resort to intermittent supplies. Surface and groundwater are the major sources of supply in urban cities. The direct consequences of intermittent supplies and poor sanitation practices are several incidences of water borne diseases posing public health risk. In order to minimize the supply-demand gap and to assure good quality of water, new techniques or models can be helpful to manage the water distribution systems (WDS) in a better way. In the present paper, a review is carried out on the existing urban water supply management methodologies with a way forward for the proper management of the water supply systems.

Veja mais

Discovering Math APIs by Mining Unit Tests

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In today's API-rich world, programmer productivity depends heavily on the programmer's ability to discover the required APIs. In this paper, we present a technique and tool, called MATHFINDER, to discover APIs for mathematical computations by mining unit tests of API methods. Given a math expression, MATHFINDER synthesizes pseudo-code to compute the expression by mapping its subexpressions to API method calls. For each subexpression, MATHFINDER searches for a method such that there is a mapping between method inputs and variables of the subexpression. The subexpression, when evaluated on the test inputs of the method under this mapping, should produce results that match the method output on a large number of tests. We implemented MATHFINDER as an Eclipse plugin for discovery of third-party Java APIs and performed a user study to evaluate its effectiveness. In the study, the use of MATHFINDER resulted in a 2x improvement in programmer productivity. In 96% of the subexpressions queried for in the study, MATHFINDER retrieved the desired API methods as the top-most result. The top-most pseudo-code snippet to implement the entire expression was correct in 93% of the cases. Since the number of methods and unit tests to mine could be large in practice, we also implement MATHFINDER in a MapReduce framework and evaluate its scalability and response time.

Veja mais

Monitoring urbanization and its implications in a mega city from space: Spatiotemporal patterns and its indicators

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Rapid and invasive urbanization has been associated with depletion of natural resources (vegetation and water resources), which in turn deteriorates the landscape structure and conditions in the local environment. Rapid increase in population due to the migration from rural areas is one of the critical issues of the urban growth. Urbanisation in India is drastically changing the land cover and often resulting in the sprawl. The sprawl regions often lack basic amenities such as treated water supply, sanitation, etc. This necessitates regular monitoring and understanding of the rate of urban development in order to ensure the sustenance of natural resources. Urban sprawl is the extent of urbanization which leads to the development of urban forms with the destruction of ecology and natural landforms. The rate of change of land use and extent of urban sprawl can be efficiently visualized and modelled with the help of geo-informatics. The knowledge of urban area, especially the growth magnitude, shape geometry, and spatial pattern is essential to understand the growth and characteristics of urbanization process. Urban pattern, shape and growth can be quantified using spatial metrics. This communication quantifies the urbanisation and associated growth pattern in Delhi. Spatial data of four decades were analysed to understand land over and land use dynamics. Further the region was divided into 4 zones and into circles of 1 km incrementing radius to understand and quantify the local spatial changes. Results of the landscape metrics indicate that the urban center was highly aggregated and the outskirts and the buffer regions were in the verge of aggregating urban patches. Shannon's Entropy index clearly depicted the outgrowth of sprawl areas in different zones of Delhi. (C) 2014 Elsevier Ltd. All rights reserved.

Veja mais

Association Rule Sharing Model for Privacy Preservation and Collaborative Data Mining Efficiency

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The disclosure of information and its misuse in Privacy Preserving Data Mining (PPDM) systems is a concern to the parties involved. In PPDM systems data is available amongst multiple parties collaborating to achieve cumulative mining accuracy. The vertically partitioned data available with the parties involved cannot provide accurate mining results when compared to the collaborative mining results. To overcome the privacy issue in data disclosure this paper describes a Key Distribution-Less Privacy Preserving Data Mining (KDLPPDM) system in which the publication of local association rules generated by the parties is published. The association rules are securely combined to form the combined rule set using the Commutative RSA algorithm. The combined rule sets established are used to classify or mine the data. The results discussed in this paper compare the accuracy of the rules generated using the C4. 5 based KDLPPDM system and the CS. 0 based KDLPPDM system using receiver operating characteristics curves (ROC).

Veja mais

20 resultados para Mine sanitation

Filtro por publicador

Mining sentiments from songs using latent dirichlet allocation

Urban water supply and management

Discovering Math APIs by Mining Unit Tests

Monitoring urbanization and its implications in a mega city from space: Spatiotemporal patterns and its indicators

Association Rule Sharing Model for Privacy Preservation and Collaborative Data Mining Efficiency