970 resultados para 650200 Mining and Extraction


Relevância:

100.00% 100.00%

Publicador:

Resumo:

The rapid growth of biological databases not only provides biologists with abundant data but also presents a big challenge in relation to the analysis of data. Many data analysis approaches such as data mining, information retrieval and machine learning have been used to extract frequent patterns from diverse biological databases. However, the discrepancies, due to the differences in the structure of databases and their terminologies, result in a significant lack of interoperability. Although ontology-based approaches have been used to integrate biological databases, the inconsistent analysis of biological databases has been greatly disregarded. This paper presents a method by which to measure the degree of inconsistency between biological databases. It not only presents a guideline for correct and efficient database integration, but also exposes high quality data for data mining and knowledge discovery.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Determining the causal relation among attributes in a domain
is a key task in the data mining and knowledge discovery. In this
paper, we applied a causal discovery algorithm to the business traveler
expenditure survey data [1]. A general class of causal models is adopted in
this paper to discover the causal relationship among continuous and discrete variables. All those factors which have direct effect on the expense
pattern of travelers could be detected. Our discovery results reinforced
some conclusions of the rough set analysis and found some new conclusions which might significantly improve the understanding of expenditure behaviors of the business traveler.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We present an agent-based system Intelligent Financial News Digest System (IFNDS) for analyzing online financial news articles and associated material. The system can abstract, synthesize, digest, and classify the contents, and assesses whether the report is favorable to any company discussed in the reports. It integrates artificial intelligence technologies including traditional information retrieval and extraction techniques for the news analysis. It makes use of keyword statistics and backpropagation training data to identify companies named in reportage whether it is, evaluatively speaking, positive, negative or neutral. The system would be of use to media such as clipping services, media management, advertising, public relations, public interest, and e-commerce professionals and government non-governmental bodies interested in monitoring the media profiles of corporations, products, and issues.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper presents new methodology towards the automatic development of multilingual Web portal for multilingual knowledge discovery and management. It aims to provide an efficient and effective framework for selecting and organizing knowledge from voluminous linguistically diverse Web contents. To achieve this, a concept-based approach that incorporates text mining and Web content mining using neural network and fuzzy techniques is proposed. First, a concept-based taxonomy of themes, which will act as the hierarchical backbone of the Web portal, is automatically generated. Second, a concept-based multilingual Web crawler is developed to intelligently harvest relevant multilingual documents from the Web. Finally, a concept-based multilingual text categorization technique is proposed to organize multilingual documents by concepts. As such, correlated multilingual Web documents can be gathered/filtered/organised/ based on their semantic content to facilitate high-performance multilingual information access.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The International Multimedia Modeling conference series is an annual forum to discuss the efficient representation, processing, interaction, integration, communication, and retrieval of multimedia information.
In particular, the 11th International Multimedia Modeling Conference (MMM2005) concentrates on common modeling frameworks for integrating the diverse fields of visual, audio, video, and virtual world information.
MMM2005 deals with emerging Multimedia Modeling topics that include:
• Audio Analysis and Modeling
• Video Manipulation and Modeling
• Video Mining and MPEG
• Image Modeling and Editing
• Image Retrieval
• Multimedia Presentation and Knowledge Sharing
• AI and Image Recognition
• Mobile and Virtual Multimedia Environments

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This thesis provides a unified and comprehensive treatment of the fuzzy neural networks as the intelligent controllers. This work has been motivated by a need to develop the solid control methodologies capable of coping with the complexity, the nonlinearity, the interactions, and the time variance of the processes under control. In addition, the dynamic behavior of such processes is strongly influenced by the disturbances and the noise, and such processes are characterized by a large degree of uncertainty. Therefore, it is important to integrate an intelligent component to increase the control system ability to extract the functional relationships from the process and to change such relationships to improve the control precision, that is, to display the learning and the reasoning abilities. The objective of this thesis was to develop a self-organizing learning controller for above processes by using a combination of the fuzzy logic and the neural networks. An on-line, direct fuzzy neural controller using the process input-output measurement data and the reference model with both structural and parameter tuning has been developed to fulfill the above objective. A number of practical issues were considered. This includes the dynamic construction of the controller in order to alleviate the bias/variance dilemma, the universal approximation property, and the requirements of the locality and the linearity in the parameters. Several important issues in the intelligent control were also considered such as the overall control scheme, the requirement of the persistency of excitation and the bounded learning rates of the controller for the overall closed loop stability. Other important issues considered in this thesis include the dependence of the generalization ability and the optimization methods on the data distribution, and the requirements for the on-line learning and the feedback structure of the controller. Fuzzy inference specific issues such as the influence of the choice of the defuzzification method, T-norm operator and the membership function on the overall performance of the controller were also discussed. In addition, the e-completeness requirement and the use of the fuzzy similarity measure were also investigated. Main emphasis of the thesis has been on the applications to the real-world problems such as the industrial process control. The applicability of the proposed method has been demonstrated through the empirical studies on several real-world control problems of industrial complexity. This includes the temperature and the number-average molecular weight control in the continuous stirred tank polymerization reactor, and the torsional vibration, the eccentricity, the hardness and the thickness control in the cold rolling mills. Compared to the traditional linear controllers and the dynamically constructed neural network, the proposed fuzzy neural controller shows the highest promise as an effective approach to such nonlinear multi-variable control problems with the strong influence of the disturbances and the noise on the dynamic process behavior. In addition, the applicability of the proposed method beyond the strictly control area has also been investigated, in particular to the data mining and the knowledge elicitation. When compared to the decision tree method and the pruned neural network method for the data mining, the proposed fuzzy neural network is able to achieve a comparable accuracy with a more compact set of rules. In addition, the performance of the proposed fuzzy neural network is much better for the classes with the low occurrences in the data set compared to the decision tree method. Thus, the proposed fuzzy neural network may be very useful in situations where the important information is contained in a small fraction of the available data.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In the mining and analysis of a single long sequence, one fundamental and important problem is obtaining accurate frequencies of sequential patterns over the sequence. However, we identify that five previous frequency measures suffer from inherent inaccuracies. To obtain more accurate frequencies, we introduce two basic principles called strict anti-monotonicity and maximum-count for frequency measures. Under the two principles, a new frequency measure is presented. An algorithm is also devised to compute it. Both theoretical analysis and empirical evaluation show that more accurate frequencies can be obtained under the new measure

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Human associated delay-tolerant networks (HDTNs) are new networks for DTNs, where mobile devices are associated with humans and demonstrate social related communication characteristics. As most of recent works use real social trace files to study the date forwarding in HDTNs, the privacy protection becomes a serious issue. Traditional privacy protections need to keep the attributes semantics, such as data mining and information retrieval. However, in HDTNs, it is not necessary to keep these meaningful semantics. In this paper, instead, we propose to anonymize the original data by coding to preserve individual's privacy and apply Privacy Protected Data Forwarding (PPDF) model to select the top N nodes to perform the multicast. We use both MIT Reality and Infocom 06 datasets, which are human associated mobile network trace file, to simulate our model. The results of our simulations show that this method can achieve a high data forwarding performance while protect the nodes' privacy as well.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Technology in the manufacturing sector has seen rapid change, transforming from stand alone, manual processes to smart, integrated systems. We have witnessed the migration of relay-based systems to advance SCADA systems, manual pro-duction to fully automated, and hand written reports to interactive computer-based dashboards. We are now seeing the emergence of smart products manufactured in smart plants and the evolution of smart services in manufacturing. Future manu-facturing systems will be distinguished by intelligent machines, automation and human factors’ integration. This talk will focus on how knowledge can be embed-ded in processes and products through the use of simulation and modelling tools to streamline future smart production systems and improve product quality. The implications to future smart manufacturing enterprises are explored through a se-ries of case studies from aerospace, mining and small and medium manufacturing enterprises.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Factories of the Future will be distinguished by intelligent machines, automation, human factors integration and knowledge management. Modelling and simulation is recognised as a key enabling technology essential to economic, social and environmental sustainability of future manufacturing systems. This talk will explore the history, recent achievements and directions in modelling and simulation for 21st century factories and supply chains. A systems science approach is employed, from stakeholder engagement through participative modelling to self-tuning and self-assembling simulations. Our contributions lower the cost of the application of modelling and simulation to manufacturing processes, enabling real time planning, dynamic risk analysis, dashboards and 3D visualisation. This realisation of the virtual factory integrates human factors and decisions into the core technology platform. The implications to future manufacturing enterprises are explored through a series of case studies from aerospace, mining and small and medium manufacturing enterprises.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this paper, we study two tightly coupled issues: space-crossing community detection and its influence on data forwarding in Mobile Social Networks (MSNs) by taking the hybrid underlying networks with infrastructure support into consideration. The hybrid underlying network is composed of large numbers of mobile users and a small portion of Access Points (APs). Because APs can facilitate the communication among long-distance nodes, the concept of physical proximity community can be extended to be one across the geographical space. In this work, we first investigate a space-crossing community detection method for MSNs. Based on the detection results, we design a novel data forwarding algorithm SAAS (Social Attraction and AP Spreading), and show how to exploit the space-crossing communities to improve the data forwarding efficiency. We evaluate our SAAS algorithm on real-life data from MIT Reality Mining and University of Illinois Movement (UIM). Results show that space-crossing community plays a positive role in data forwarding in MSNs in terms of delivery ratio and delay. Based on this new type of community, SAAS achieves a better performance than existing social community-based data forwarding algorithms in practice, including Bubble Rap and Nguyen's Routing algorithms. © 2014 IEEE.