951 resultados para Knowledge Discovery
Resumo:
This paper studies Knowledge Discovery (KD) using Tabu Search and Hill Climbing within Case-Based Reasoning (CBR) as a hyper-heuristic method for course timetabling problems. The aim of the hyper-heuristic is to choose the best heuristic(s) for given timetabling problems according to the knowledge stored in the case base. KD in CBR is a 2-stage iterative process on both case representation and the case base. Experimental results are analysed and related research issues for future work are discussed.
Resumo:
This paper presents a case-based heuristic selection approach for automated university course and exam timetabling. The method described in this paper is motivated by the goal of developing timetabling systems that are fundamentally more general than the current state of the art. Heuristics that worked well in previous similar situations are memorized in a case base and are retrieved for solving the problem in hand. Knowledge discovery techniques are employed in two distinct scenarios. Firstly, we model the problem and the problem solving situations along with specific heuristics for those problems. Secondly, we refine the case base and discard cases which prove to be non-useful in solving new problems. Experimental results are presented and analyzed. It is shown that case based reasoning can act effectively as an intelligent approach to learn which heuristics work well for particular timetabling situations. We conclude by outlining and discussing potential research issues in this critical area of knowledge discovery for different difficult timetabling problems.
Resumo:
This paper presents a case-based heuristic selection approach for automated university course and exam timetabling. The method described in this paper is motivated by the goal of developing timetabling systems that are fundamentally more general than the current state of the art. Heuristics that worked well in previous similar situations are memorized in a case base and are retrieved for solving the problem in hand. Knowledge discovery techniques are employed in two distinct scenarios. Firstly, we model the problem and the problem solving situations along with specific heuristics for those problems. Secondly, we refine the case base and discard cases which prove to be non-useful in solving new problems. Experimental results are presented and analyzed. It is shown that case based reasoning can act effectively as an intelligent approach to learn which heuristics work well for particular timetabling situations. We conclude by outlining and discussing potential research issues in this critical area of knowledge discovery for different difficult timetabling problems.
Resumo:
Um dos principais problemas que estação de Tratamento de Água do Monte Novo tem vindo a apresentar é o aparecimento de teores em manganês na água tratada, que por vezes ultrapassam o valor paramétrico estabelecido no Decreto-Lei 306/07, 27 de Agosto (50 g dm-3). Este trabalho permitiu relacionar resultados de várias determinações analíticas efectuadas no laboratório da empresa Águas do Centro Alentejo e, através deles construir modelos fundamentados em técnicas e Descoberta de Conhecimento em Base de Dados que permitiram responder ao problema identificado. Foi ainda possível estabelecer a época do ano em que é mais provável o aparecimento de teores elevados manganês na água tratada. Além disso, mostrou-se que a tomada de água desempenha um papel relevante no aparecimento deste metal na água tratada. Os modelos desenvolvidos permitiram também estabelecer as condições em que é provável o aparecimento de turvação na cisterna de água tratada. Estas estão relacionadas com o pH, o teor em manganês e o teor em ferro. Foi ainda realçada a importância da correcção do pH na fase final do processo de tratamento. Por um lado, o pH deve ser suficientemente elevado para garantir uma água incrustante e, por outro, deve ser baixo para evitar problemas de turvação na cisterna da água tratada. ABSTRACT; The present study took place in the water treatment plant of Monte Novo. This study aimed for solutions to the problem of high values of manganese concentration in the treated water, in some periods of the year. The present work reports models for manganese concentration and for turbidity using Knowledge Discovery Techniques in Data Bases.
Resumo:
Este Trabajo de Fin de Grado (TFG) se engloba en la línea general Social CRM. Concretamente, está vinculado a un trabajo de investigación llamado “Knowledge discovery in social networks by using a logic-based treatment of implications” desarrollado por P. Cordero, M. Enciso, A. Mora, M. Ojeda-Aciego y C. Rossi en la Universidad de Málaga, en el cual se ofrecen nuevas soluciones para la identificación de influencias de los usuarios en las redes sociales mediante herramientas como el Analisis de Conceptos Formales (FCA). El TFG tiene como objetivo el desarrollo de una aplicación que permita al usuario crear una configuración minimal de usuarios en Twitter a los que seguir para conocer información sobre un número determinado de temas. Para ello, obtendremos información sobre dichos temas mediante la API REST pública que proporciona Twitter y procesaremos los datos mediante algoritmos basados en el Análisis de Conceptos Formales (FCA). Posteriormente, la interpretación de los resultados de dicho análisis nos proporcionará información útil sobre lo expuesto al principio. Así, el trabajo se ha dividido en tres partes fundamentales: 1. Obtención de información (fuentes) 2. Procesamiento de los datos 3. Análisis de resultados El sistema se ha implementado como una aplicación web Java EE 7, utilizando JSF para las interfaces. Para el desarrollo web se han utilizado tecnologías y frameworks como Javascript, JQuery, CSS3, Bootstrap, Twitter4J, etc. Además, se ha seguido una metodología incremental para el desarrollo del proyecto y se ha usado UML como herramienta de modelado. Este proyecto se presenta como un trabajo inicial en el que se expondrán, además del sistema implementado, diversos problemas reales y ejemplos que prueben su funcionamiento y muestren la utilidad práctica del mismo
Resumo:
Analysis of data without labels is commonly subject to scrutiny by unsupervised machine learning techniques. Such techniques provide more meaningful representations, useful for better understanding of a problem at hand, than by looking only at the data itself. Although abundant expert knowledge exists in many areas where unlabelled data is examined, such knowledge is rarely incorporated into automatic analysis. Incorporation of expert knowledge is frequently a matter of combining multiple data sources from disparate hypothetical spaces. In cases where such spaces belong to different data types, this task becomes even more challenging. In this paper we present a novel immune-inspired method that enables the fusion of such disparate types of data for a specific set of problems. We show that our method provides a better visual understanding of one hypothetical space with the help of data from another hypothetical space. We believe that our model has implications for the field of exploratory data analysis and knowledge discovery.
Resumo:
In recent years, a plethora of approaches have been proposed to deal with the increasingly challenging task of multi-output regression. This paper provides a survey on state-of-the-art multi-output regression methods, that are categorized as problem transformation and algorithm adaptation methods. In addition, we present the mostly used performance evaluation measures, publicly available data sets for multi-output regression real-world problems, as well as open-source software frameworks.
Resumo:
Analyzing large-scale gene expression data is a labor-intensive and time-consuming process. To make data analysis easier, we developed a set of pipelines for rapid processing and analysis poplar gene expression data for knowledge discovery. Of all pipelines developed, differentially expressed genes (DEGs) pipeline is the one designed to identify biologically important genes that are differentially expressed in one of multiple time points for conditions. Pathway analysis pipeline was designed to identify the differentially expression metabolic pathways. Protein domain enrichment pipeline can identify the enriched protein domains present in the DEGs. Finally, Gene Ontology (GO) enrichment analysis pipeline was developed to identify the enriched GO terms in the DEGs. Our pipeline tools can analyze both microarray gene data and high-throughput gene data. These two types of data are obtained by two different technologies. A microarray technology is to measure gene expression levels via microarray chips, a collection of microscopic DNA spots attached to a solid (glass) surface, whereas high throughput sequencing, also called as the next-generation sequencing, is a new technology to measure gene expression levels by directly sequencing mRNAs, and obtaining each mRNA’s copy numbers in cells or tissues. We also developed a web portal (http://sys.bio.mtu.edu/) to make all pipelines available to public to facilitate users to analyze their gene expression data. In addition to the analyses mentioned above, it can also perform GO hierarchy analysis, i.e. construct GO trees using a list of GO terms as an input.
Resumo:
A problemática relacionada com a modelação da qualidade da água de albufeiras pode ser abordada de diversos pontos de vista. Neste trabalho recorre-se a metodologias de resolução de problemas que emanam da Área Cientifica da Inteligência Artificial, assim como a ferramentas utilizadas na procura de soluções como as Árvores de Decisão, as Redes Neuronais Artificiais e a Aproximação de Vizinhanças. Actualmente os métodos de avaliação da qualidade da água são muito restritivos já que não permitem aferir a qualidade da água em tempo real. O desenvolvimento de modelos de previsão baseados em técnicas de Descoberta de Conhecimento em Bases de Dados, mostrou ser uma alternativa tendo em vista um comportamento pró-activo que pode contribuir decisivamente para diagnosticar, preservar e requalificar as albufeiras. No decurso do trabalho, foi utilizada a aprendizagem não-supervisionada tendo em vista estudar a dinâmica das albufeiras sendo descritos dois comportamentos distintos, relacionados com a época do ano. ABSTRACT: The problems related to the modelling of water quality in reservoirs can be approached from different viewpoints. This work resorts to methods of resolving problems emanating from the Scientific Area of Artificial lntelligence as well as to tools used in the search for solutions such as Decision Trees, Artificial Neural Networks and Nearest-Neighbour Method. Currently, the methods for assessing water quality are very restrictive because they do not indicate the water quality in real time. The development of forecasting models, based on techniques of Knowledge Discovery in Databases, shows to be an alternative in view of a pro-active behavior that may contribute to diagnose, maintain and requalify the water bodies. ln this work. unsupervised learning was used to study the dynamics of reservoirs, being described two distinct behaviors, related to the time of year.
Resumo:
Dissertation presented to obtain the Ph.D degree in Bioinformatics
Resumo:
Three different periods may be considered in the evolution of knowledge about the clinical and epidemiological aspects of Chagas disease since its discovery: (a) early period concerning the studies carried out by Carlos Chagas in Lassance with the collaboration of other investigators of the Manguinhos School. At that time the disease was described and the parasite, transmitters and reservoirs were studied. The coexistence of endemic goiter in the same region generated some confusion about the clinical forms of the disease; (b) second period involving uncertainty and the description of isolated cases, which lasted until the 1940 decade. Many acute cases were described during this period and the disease was recognized in many Latin American countries. Particularly important were the studies of the Argentine Mission of Regional Pathology Studies, which culminated with the description of the Romaña sign in the 1930 decade, facilitating the diagnosis of the early phase of the disease. However, the chronic phase, which was the most important, continued to be difficult to recognize; (c) period of consolidation of knowledge and recognition of the importance of Chagas disease. Studies conducted by Laranja, Dias and Nóbrega in Bambuí updated the description of Chagas heart disease made by Carlos Chagas and Eurico Villela. From then on, the disease was more easily recognized, especially with the emphasis on the use of a serologic diagnosis; (d) period of enlargement of knowledges on the disease. The studies on denervation conducted in Ribeirão Preto by Fritz Köberle starting in the 1950 decade led to a better understanding of the relations between Chagas disease and megaesophagus and other visceral megas detected in endemic areas.
Resumo:
Mode of access: Internet.
Resumo:
v.1. From creation to the patriarchs.--v.2. From Moses to the judges.--v.3. From Samson to Solomon.--v.4. Rehoboam to Hezekiah.--v.5. From Menasseth to Zedekia and contemporary prophets.--v.6. From the exile to Malachi.
Resumo:
Mode of access: Internet.
Resumo:
Editors: Jan. 1920-June 1921, A.S. Russell.---July 1921-Dec. 1923, Edward Liveing.---Jan.-Apr. 1924, R.J.V. Pulvertaft.---May 1924-Mar. 1926, H.B.C. Pollard.---Apr. 1926-1931, J.A. Benn.---1932-34, Bernard Lintern.---1934-Mar. 1938, L.R. Muirhead.---Apr. 1938-1940, C.P. Snow.