47 resultados para Exploração de dados (Computação)

em Universidade Federal do Rio Grande do Norte(UFRN)


Relevância:

90.00% 90.00%

Publicador:

Resumo:

Educational Data Mining is an application domain in artificial intelligence area that has been extensively explored nowadays. Technological advances and in particular, the increasing use of virtual learning environments have allowed the generation of considerable amounts of data to be investigated. Among the activities to be treated in this context exists the prediction of school performance of the students, which can be accomplished through the use of machine learning techniques. Such techniques may be used for student’s classification in predefined labels. One of the strategies to apply these techniques consists in their combination to design multi-classifier systems, which efficiency can be proven by results achieved in other studies conducted in several areas, such as medicine, commerce and biometrics. The data used in the experiments were obtained from the interactions between students in one of the most used virtual learning environments called Moodle. In this context, this paper presents the results of several experiments that include the use of specific multi-classifier systems systems, called ensembles, aiming to reach better results in school performance prediction that is, searching for highest accuracy percentage in the student’s classification. Therefore, this paper presents a significant exploration of educational data and it shows analyzes of relevant results about these experiments.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Alimentation is essential in life. Concerning omnivores, characterized by the necessity of a varied diet to satisfy their metabolic needs, it is extremely advantageous the assumption of new foods. However, the assumption of new unknown foods is, potentially dangerous, because of possible intoxications. In this sense, one of the most important behaviors related to reducing risks is the so called food neophobia, characterized by the rejection of new foods and/or an ingestion of very little amounts. The aim of the present study was to investigate if age, sex and socio-economical status were able to influence food neophobia. The neophobia has been described in a range of 3-6 years old children taken both from public and private schools within the city of Natal-RN. Four different type of ice-creams, each one characterized by a different flavor, have been utilized. Two flavors were known to the young and the remaining two flavor were new. We didn't find significant differences between the investigated variables. However, the exploitation of data from the survey conducted showed that the ease or not to accept new foods obtained, was correlated with the variables under the same guidelines observed in literature. Aspects related to the stimulus used probably eased the neophobic answer. Then, it is suggested that the food neophobia can be influenced by sex, age and socioeconomic factors of individuaIs. Neophobia tends to be more common in girls, with ages between three to four years old and with a low leveI socioeconomic. In this sense, given the importance of kid neophobic reaction to the development of dietary patterns of other life's stages, it is necessary to make further studies to better explain this phenomenon. Given the pivotal role of food neophobia to the development of alimentary habits within all ages of life, other studies will be necessary for a better comprehension of such phenomena. Key-words: food neophobia; Evolutionary Psychology;children food intake; diet restriction; children's diet development

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The increase of applications complexity has demanded hardware even more flexible and able to achieve higher performance. Traditional hardware solutions have not been successful in providing these applications constraints. General purpose processors have inherent flexibility, since they perform several tasks, however, they can not reach high performance when compared to application-specific devices. Moreover, since application-specific devices perform only few tasks, they achieve high performance, although they have less flexibility. Reconfigurable architectures emerged as an alternative to traditional approaches and have become an area of rising interest over the last decades. The purpose of this new paradigm is to modify the device s behavior according to the application. Thus, it is possible to balance flexibility and performance and also to attend the applications constraints. This work presents the design and implementation of a coarse grained hybrid reconfigurable architecture to stream-based applications. The architecture, named RoSA, consists of a reconfigurable logic attached to a processor. Its goal is to exploit the instruction level parallelism from intensive data-flow applications to accelerate the application s execution on the reconfigurable logic. The instruction level parallelism extraction is done at compile time, thus, this work also presents an optimization phase to the RoSA architecture to be included in the GCC compiler. To design the architecture, this work also presents a methodology based on hardware reuse of datapaths, named RoSE. RoSE aims to visualize the reconfigurable units through reusability levels, which provides area saving and datapath simplification. The architecture presented was implemented in hardware description language (VHDL). It was validated through simulations and prototyping. To characterize performance analysis some benchmarks were used and they demonstrated a speedup of 11x on the execution of some applications

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The increasing complexity of integrated circuits has boosted the development of communications architectures like Networks-on-Chip (NoCs), as an architecture; alternative for interconnection of Systems-on-Chip (SoC). Networks-on-Chip complain for component reuse, parallelism and scalability, enhancing reusability in projects of dedicated applications. In the literature, lots of proposals have been made, suggesting different configurations for networks-on-chip architectures. Among all networks-on-chip considered, the architecture of IPNoSys is a non conventional one, since it allows the execution of operations, while the communication process is performed. This study aims to evaluate the execution of data-flow based applications on IPNoSys, focusing on their adaptation against the design constraints. Data-flow based applications are characterized by the flowing of continuous stream of data, on which operations are executed. We expect that these type of applications can be improved when running on IPNoSys, because they have a programming model similar to the execution model of this network. By observing the behavior of these applications when running on IPNoSys, were performed changes in the execution model of the network IPNoSys, allowing the implementation of an instruction level parallelism. For these purposes, analysis of the implementations of dataflow applications were performed and compared

Relevância:

40.00% 40.00%

Publicador:

Resumo:

In the oil prospection research seismic data are usually irregular and sparsely sampled along the spatial coordinates due to obstacles in placement of geophones. Fourier methods provide a way to make the regularization of seismic data which are efficient if the input data is sampled on a regular grid. However, when these methods are applied to a set of irregularly sampled data, the orthogonality among the Fourier components is broken and the energy of a Fourier component may "leak" to other components, a phenomenon called "spectral leakage". The objective of this research is to study the spectral representation of irregularly sampled data method. In particular, it will be presented the basic structure of representation of the NDFT (nonuniform discrete Fourier transform), study their properties and demonstrate its potential in the processing of the seismic signal. In this way we study the FFT (fast Fourier transform) and the NFFT (nonuniform fast Fourier transform) which rapidly calculate the DFT (discrete Fourier transform) and NDFT. We compare the recovery of the signal using the FFT, DFT and NFFT. We approach the interpolation of seismic trace using the ALFT (antileakage Fourier transform) to overcome the problem of spectral leakage caused by uneven sampling. Applications to synthetic and real data showed that ALFT method works well on complex geology seismic data and suffers little with irregular spatial sampling of the data and edge effects, in addition it is robust and stable with noisy data. However, it is not as efficient as the FFT and its reconstruction is not as good in the case of irregular filling with large holes in the acquisition.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

In the oil prospection research seismic data are usually irregular and sparsely sampled along the spatial coordinates due to obstacles in placement of geophones. Fourier methods provide a way to make the regularization of seismic data which are efficient if the input data is sampled on a regular grid. However, when these methods are applied to a set of irregularly sampled data, the orthogonality among the Fourier components is broken and the energy of a Fourier component may "leak" to other components, a phenomenon called "spectral leakage". The objective of this research is to study the spectral representation of irregularly sampled data method. In particular, it will be presented the basic structure of representation of the NDFT (nonuniform discrete Fourier transform), study their properties and demonstrate its potential in the processing of the seismic signal. In this way we study the FFT (fast Fourier transform) and the NFFT (nonuniform fast Fourier transform) which rapidly calculate the DFT (discrete Fourier transform) and NDFT. We compare the recovery of the signal using the FFT, DFT and NFFT. We approach the interpolation of seismic trace using the ALFT (antileakage Fourier transform) to overcome the problem of spectral leakage caused by uneven sampling. Applications to synthetic and real data showed that ALFT method works well on complex geology seismic data and suffers little with irregular spatial sampling of the data and edge effects, in addition it is robust and stable with noisy data. However, it is not as efficient as the FFT and its reconstruction is not as good in the case of irregular filling with large holes in the acquisition.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Since centuries ago, the Asians use seaweed as an important source of feeding and are their greatest world-wide consumers. The migration of these peoples for other countries, made the demand for seaweed to increase. This increasing demand prompted an industry with annual values of around US$ 6 billion. The algal biomass used for the industry is collected in natural reservoirs or cultivated. The market necessity for products of the seaweed base promotes an unsustainable exploration of the natural banks, compromising its associated biological balance. In this context, seaweed culture appears as a viable alternative to prevent the depletion of these natural supplies. Geographic Information Systems (GIS) provide space and produce information that can facilitate the evaluation of important physical and socio-economic characteristics for the planning of seaweed culture. This objective of this study is to identify potential coastal areas for seaweed culture in the state of Rio Grande do Norte, from the integration of social-environmental data in the SIG. In order to achieve this objective, a geo-referred database composed of geographical maps, nautical maps and orbital digital images was assembled; and a bank of attributes including physical and oceanographical variables (winds, chains, bathymetry, operational distance from the culture) and social and environmental factors (main income, experience with seaweed harvesting, demographic density, proximity of the sheltered coast and distance of the banks) was produced. In the modeling of the data, the integration of the space database with the bank of attributes for the attainment of the map of potentiality of seaweed culture was carried out. Of a total of 2,011 ha analyzed by the GIS for the culture of seaweed, around 34% or 682 ha were indicated as high potential, 55% or 1,101 ha as medium potential, and 11% or 228 ha as low potential. The good indices of potentiality obtained in the localities studied demonstrate that there are adequate conditions for the installation of seaweed culture in the state of Rio Grande do Norte

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Produced water is characterized as one of the most common wastes generated during exploration and production of oil. This work aims to develop methodologies based on comparative statistical processes of hydrogeochemical analysis of production zones in order to minimize types of high-cost interventions to perform identification test fluids - TIF. For the study, 27 samples were collected from five different production zones were measured a total of 50 chemical species. After the chemical analysis was applied the statistical data, using the R Statistical Software, version 2.11.1. Statistical analysis was performed in three steps. In the first stage, the objective was to investigate the behavior of chemical species under study in each area of production through the descriptive graphical analysis. The second step was to identify a function that classify production zones from each sample, using discriminant analysis. In the training stage, the rate of correct classification function of discriminant analysis was 85.19%. The next stage of processing of the data used for Principal Component Analysis, by reducing the number of variables obtained from the linear combination of chemical species, try to improve the discriminant function obtained in the second stage and increase the discrimination power of the data, but the result was not satisfactory. In Profile Analysis curves were obtained for each production area, based on the characteristics of the chemical species present in each zone. With this study it was possible to develop a method using hydrochemistry and statistical analysis that can be used to distinguish the water produced in mature fields of oil, so that it is possible to identify the zone of production that is contributing to the excessive elevation of the water volume.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A legislação ambiental e os principais agentes que se relacionam com a empresa se constituem em fatores exógenos que não podem ser negligenciados ao formular-se e avaliar-se a política ambiental corporativa. As influências exógenas e seus efeitos sobre a gestão ambiental e o gerenciamento de projetos de exploração e produção (E&P) e, por essa via, sobre o desempenho ambiental, foram objetos de estudo desta tese. Embora o desempenho ambiental seja um assunto relevante, a pesquisa sobre esse tema ainda é escassa. Tal carência desponta ainda mais acentuada quando se aborda o desempenho ambiental de projetos na indústria de petróleo e gás. O principal objetivo deste estudo foi avaliar a relação entre a legislação ambiental vigente, as ações de órgãos reguladores, fornecedores, empresas terceirizadas e comunidades locais e o desempenho ambiental dos projetos de E&P na indústria de petróleo e gás e, também, analisar os efeitos do sistema de gestão ambiental e o gerenciamento dos projetos sobre tal desempenho. Na fase abdutiva, foi conduzido um estudo de caso com abordagem qualitativa em uma grande empresa brasileira do setor de petróleo e gás, na fase dedutiva, foi realizada uma pesquisa survey explanatória de corte transversal com abordagem quantitativa, incluindo 113 projetos de E&P de cinco unidades executoras da empresa. Foi formulado um modelo conceitual, com cinco construtos e sete hipóteses de pesquisa, representativo dos efeitos de fatores externos sobre o desempenho ambiental dos projetos de E&P. Os dados foram tratados aplicando a Análise Fatorial Exploratória e a Modelagem de Equações Estruturais com aplicação dos softwares IBM® SPSS® Statistics 20.0 e IBM® SPSS® Amos 18.0. O modelo de equações estruturais foi reespecificado e estimado utilizando o método de Máxima Verossimilhança e o procedimento bootstrap com 2000 reamostragens, até alcançar adequados valores dos índices de ajustamento. O modelo mostrou boa aderência às evidências empíricas, representando uma teoria explicativa dos fatores que influenciam o desempenho ambiental dos projetos de E&P na empresa estudada. As estatísticas descritivas apontaram adequado desempenho dos projetos de E&P com relação aos efluentes descartados, volume de água reutilizada, redução de resíduos e práticas de reciclagem. Identificou-se que projetos de maior porte alcançam melhor desempenho ambiental em relação aos de menor tamanho. Não foram achadas diferenças significativas entre os desempenhos de projetos executados por unidades operacionais distintas. Os resultados da modelagem indicaram que nem a legislação ambiental, nem os agentes externos exercem influência significativa sobre a sistemática da gestão dos projetos de E&P. Os agentes externos atuam sobre a gestão ambiental da empresa exercitando capacidades colaborativas, obstrutivas e propositivas. A legislação ambiental é percebida como entrave ao desenvolvimento dos projetos ao longo de seu ciclo de vida, principalmente, pelas deficiências dos órgãos ambientais. Identificou-se que o sistema de gestão ambiental influencia diretamente o Programa de Desenvolvimento e Execução de Projetos de E&P, que, por sua vez, provoca efeitos diretos e indiretos sobre o desempenho ambiental. Finalmente, comprovou-se que o Sistema de Gestão Ambiental da empresa é determinante para o desempenho ambiental dos projetos de E&P, tanto pelos seus efeitos diretos, como pelos indiretos, estes últimos mediados pela sistemática de gestão dos projetos de E&P

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The progresses of the Internet and telecommunications have been changing the concepts of Information Technology IT, especially with regard to outsourcing services, where organizations seek cost-cutting and a better focus on the business. Along with the development of that outsourcing, a new model named Cloud Computing (CC) evolved. It proposes to migrate to the Internet both data processing and information storing. Among the key points of Cloud Computing are included cost-cutting, benefits, risks and the IT paradigms changes. Nonetheless, the adoption of that model brings forth some difficulties to decision-making, by IT managers, mainly with regard to which solutions may go to the cloud, and which service providers are more appropriate to the Organization s reality. The research has as its overall aim to apply the AHP Method (Analytic Hierarchic Process) to decision-making in Cloud Computing. There to, the utilized methodology was the exploratory kind and a study of case applied to a nationwide organization (Federation of Industries of RN). The data collection was performed through two structured questionnaires answered electronically by IT technicians, and the company s Board of Directors. The analysis of the data was carried out in a qualitative and comparative way, and we utilized the software to AHP method called Web-Hipre. The results we obtained found the importance of applying the AHP method in decision-making towards the adoption of Cloud Computing, mainly because on the occasion the research was carried out the studied company already showed interest and necessity in adopting CC, considering the internal problems with infrastructure and availability of information that the company faces nowadays. The organization sought to adopt CC, however, it had doubt regarding the cloud model and which service provider would better meet their real necessities. The application of the AHP, then, worked as a guiding tool to the choice of the best alternative, which points out the Hybrid Cloud as the ideal choice to start off in Cloud Computing. Considering the following aspects: the layer of Infrastructure as a Service IaaS (Processing and Storage) must stay partly on the Public Cloud and partly in the Private Cloud; the layer of Platform as a Service PaaS (Software Developing and Testing) had preference for the Private Cloud, and the layer of Software as a Service - SaaS (Emails/Applications) divided into emails to the Public Cloud and applications to the Private Cloud. The research also identified the important factors to hiring a Cloud Computing provider

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The objective of this thesis is proposes a method for a mobile robot to build a hybrid map of an indoor, semi-structured environment. The topological part of this map deals with spatial relationships among rooms and corridors. It is a topology-based map, where the edges of the graph are rooms or corridors, and each link between two distinct edges represents a door. The metric part of the map consists in a set of parameters. These parameters describe a geometric figure which adapts to the free space of the local environment. This figure is calculated by a set of points which sample the boundaries of the local free space. These points are obtained with range sensors and with knowledge about the robot s pose. A method based on generalized Hough transform is applied to this set of points in order to obtain the geomtric figure. The building of the hybrid map is an incremental procedure. It is accomplished while the robot explores the environment. Each room is associated with a metric local map and, consequently, with an edge of the topo-logical map. During the mapping procedure, the robot may use recent metric information of the environment to improve its global or relative pose

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Techniques of optimization known as metaheuristics have achieved success in the resolution of many problems classified as NP-Hard. These methods use non deterministic approaches that reach very good solutions which, however, don t guarantee the determination of the global optimum. Beyond the inherent difficulties related to the complexity that characterizes the optimization problems, the metaheuristics still face the dilemma of xploration/exploitation, which consists of choosing between a greedy search and a wider exploration of the solution space. A way to guide such algorithms during the searching of better solutions is supplying them with more knowledge of the problem through the use of a intelligent agent, able to recognize promising regions and also identify when they should diversify the direction of the search. This way, this work proposes the use of Reinforcement Learning technique - Q-learning Algorithm - as exploration/exploitation strategy for the metaheuristics GRASP (Greedy Randomized Adaptive Search Procedure) and Genetic Algorithm. The GRASP metaheuristic uses Q-learning instead of the traditional greedy-random algorithm in the construction phase. This replacement has the purpose of improving the quality of the initial solutions that are used in the local search phase of the GRASP, and also provides for the metaheuristic an adaptive memory mechanism that allows the reuse of good previous decisions and also avoids the repetition of bad decisions. In the Genetic Algorithm, the Q-learning algorithm was used to generate an initial population of high fitness, and after a determined number of generations, where the rate of diversity of the population is less than a certain limit L, it also was applied to supply one of the parents to be used in the genetic crossover operator. Another significant change in the hybrid genetic algorithm is the proposal of a mutually interactive cooperation process between the genetic operators and the Q-learning algorithm. In this interactive/cooperative process, the Q-learning algorithm receives an additional update in the matrix of Q-values based on the current best solution of the Genetic Algorithm. The computational experiments presented in this thesis compares the results obtained with the implementation of traditional versions of GRASP metaheuristic and Genetic Algorithm, with those obtained using the proposed hybrid methods. Both algorithms had been applied successfully to the symmetrical Traveling Salesman Problem, which was modeled as a Markov decision process

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In the recovering process of oil, rock heterogeneity has a huge impact on how fluids move in the field, defining how much oil can be recovered. In order to study this variability, percolation theory, which describes phenomena involving geometry and connectivity are the bases, is a very useful model. Result of percolation is tridimensional data and have no physical meaning until visualized in form of images or animations. Although a lot of powerful and sophisticated visualization tools have been developed, they focus on generation of planar 2D images. In order to interpret data as they would be in the real world, virtual reality techniques using stereo images could be used. In this work we propose an interactive and helpful tool, named ZSweepVR, based on virtual reality techniques that allows a better comprehension of volumetric data generated by simulation of dynamic percolation. The developed system has the ability to render images using two different techniques: surface rendering and volume rendering. Surface rendering is accomplished by OpenGL directives and volume rendering is accomplished by the Zsweep direct volume rendering engine. In the case of volumetric rendering, we implemented an algorithm to generate stereo images. We also propose enhancements in the original percolation algorithm in order to get a better performance. We applied our developed tools to a mature field database, obtaining satisfactory results. The use of stereoscopic and volumetric images brought valuable contributions for the interpretation and clustering formation analysis in percolation, what certainly could lead to better decisions about the exploration and recovery process in oil fields

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This graduate thesis proposes a model to asynchronously replicate heterogeneous databases. This model singularly combines -in a systematic way and in a single project -different concepts, techniques and paradigms related to the areas of database replication and management of heterogeneous databases. One of the main advantages of the replication is to allow applications to continue to process information, during time intervals when they are off the network and to trigger the database synchronization, as soon as the network connection is reestablished. Therefore, the model introduces a communication and update protocol that takes in consideration the environment of asynchronous characteristics used. As part of the work, a tool was developed in Java language, based on the model s premises in order to process, test, simulate and validate the proposed model

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In recent decades, changes have been occurring in the telecommunications industry, allied to competition driven by the policies of privatization and concessions, have fomented the world market irrefutably causing the emergence of a new reality. The reflections in Brazil have become evident due to the appearance of significant growth rates, getting in 2012 to provide a net operating income of 128 billion dollars, placing the country among the five major powers in the world in mobile communications. In this context, an issue of increasing importance to the financial health of companies is their ability to retain their customers, as well as turn them into loyal customers. The appearance of infidelity from customer operators has been generating monthly rates shutdowns about two to four percent per month accounting for business management one of its biggest challenges, since capturing a new customer has meant an expenditure greater than five times to retention. For this purpose, models have been developed by means of structural equation modeling to identify the relationships between the various determinants of customer loyalty in the context of services. The original contribution of this thesis is to develop a model for loyalty from the identification of relationships between determinants of satisfaction (latent variables) and the inclusion of attributes that determine the perceptions of service quality for the mobile communications industry, such as quality, satisfaction, value, trust, expectation and loyalty. It is a qualitative research which will be conducted with customers of operators through simple random sampling technique, using structured questionnaires. As a result, the proposed model and statistical evaluations should enable operators to conclude that customer loyalty is directly influenced by technical and operational quality of the services offered, as well as provide a satisfaction index for the mobile communication segment