63 resultados para Query clustering
Resumo:
This master’s thesis addresses the maintenance of pre-computed structures, which store a frequent or expensive query, for the nested bag data type in the high level work-flow language Pig Latin. This thesis defines a model suitable to accommodate incremental expressions over nested bags on Pig Latin. Afterwards, the partitioned normal form for sets is extended with further restrictions, in order to accommodate the nested bag model, allow the Pig Latin nest and unnest operators revert each other, and create a suitable environment to the incremental computations. Subsequently, the extended operators – extended union and extended difference – are defined for the nested bag data model with the partitioned normal form for bags (PNF Bag) restriction, and semantics for the extended operators are given. Finally, incremental data propagation expressions are proposed for the nest and unnest operators on the data model proposed with the PNF Bag restriction, and the proof of correctness is given.
Resumo:
A monitorização da qualidade da energia eléctrica tem revelado importância crescente na gestão e caracterização da rede eléctrica. Estudos revelam que os custos directos relacionados com perda de qualidade da energia eléctrica podem representar cerca de 1,5 % do PIB nacional. Para além destes, tem-se adicionalmente os custos indirectos o que se traduz num problema que necessita de minimização. No contexto da minimização dos danos causados pela degradação de energia, são utilizados equipamentos com capacidade de caracterizar a energia eléctrica através da sua monitorização. A utilização destes equipamentos têm subjacente normas de qualidade de energia, que impõem requisitos mínimos de modo a enquadrar e classificar eventos ocorridos na rede eléctrica. Deste modo obtêm-se dados coerentes provenientes de diferentes equipamentos. A monitorização dos parâmetros associados à energia eléctrica é frequentemente realizada através da instalação temporária dos esquipamentos na rede eléctrica, o que resulta numa observação de distúrbios a posteriori da sua ocasião. Esta metodologia não permite detectar o evento eléctrico original mas, quando muito, outros que se espera que sejam semelhantes ao ocorrido. Repare-se, no entanto, que existe um conjunto alargado de eventos que não são repetitivos, constituindo assim uma limitação aquela metodologia. Este trabalho descreve uma alternativa à metodologia de utilização tradicional dos equipamentos. A solução consiste em realizar um analisador de energia que faça parte integrante da instalação e permita a monitorização contínua da rede eléctrica. Este equipamento deve ter um custo suficientemente baixo para que seja justificável nesta utilização alternativa. O analisador de qualidade de energia a desenvolver tem por base o circuito integrado ADE7880, que permite obter um conjunto de parâmetros da qualidade de energia eléctrica de acordo com as normas de energia IEC 61000-4-30 e IEC 61000-4-7. Este analisador permite a recolha contínua de dados específicos da rede eléctrica, e que posteriormente serão armazenados e colocados à disposição do utilizador. Deste modo os dados recolhidos serão apresentados ao utilizador para consulta, de maneira a verificar, de modo continuo a eventual ocorrência das anomalias na rede. Os valores adquiridos podem ainda ser reutilizados vantajosamente para muitas outras finalidades tais como efectuar estudos sobre a optimização energética. O trabalho presentemente desenvolvido decorre de uma utilização alternativa do dispositivo WeSense Energy1 desenvolvido pela equipa da Evoleo Technologies. A presente vertente permite obter parâmetros determinados pelo ADE7880 tais como por exemplo harmónicos, eventos transitórios de tensão e corrente e o desfasamento entre fases, realizando assim uma nova versão do dispositivo, o WeSense Energy2. Adicionalmente este trabalho inclui a visualização remota dos através de uma página web.
Resumo:
This paper analyses forest fires in the perspective of dynamical systems. Forest fires exhibit complex correlations in size, space and time, revealing features often present in complex systems, such as the absence of a characteristic length-scale, or the emergence of long range correlations and persistent memory. This study addresses a public domain forest fires catalogue, containing information of events for Portugal, during the period from 1980 up to 2012. The data is analysed in an annual basis, modelling the occurrences as sequences of Dirac impulses with amplitude proportional to the burnt area. First, we consider mutual information to correlate annual patterns. We use visualization trees, generated by hierarchical clustering algorithms, in order to compare and to extract relationships among the data. Second, we adopt the Multidimensional Scaling (MDS) visualization tool. MDS generates maps where each object corresponds to a point. Objects that are perceived to be similar to each other are placed on the map forming clusters. The results are analysed in order to extract relationships among the data and to identify forest fire patterns.
Resumo:
In this paper we analyze the behavior of tornado time-series in the U.S. from the perspective of dynamical systems. A tornado is a violently rotating column of air extending from a cumulonimbus cloud down to the ground. Such phenomena reveal features that are well described by power law functions and unveil characteristics found in systems with long range memory effects. Tornado time series are viewed as the output of a complex system and are interpreted as a manifestation of its dynamics. Tornadoes are modeled as sequences of Dirac impulses with amplitude proportional to the events size. First, a collection of time series involving 64 years is analyzed in the frequency domain by means of the Fourier transform. The amplitude spectra are approximated by power law functions and their parameters are read as an underlying signature of the system dynamics. Second, it is adopted the concept of circular time and the collective behavior of tornadoes analyzed. Clustering techniques are then adopted to identify and visualize the emerging patterns.
Resumo:
In this paper, we apply multidimensional scaling (MDS) and parametric similarity indices (PSI) in the analysis of complex systems (CS). Each CS is viewed as a dynamical system, exhibiting an output time-series to be interpreted as a manifestation of its behavior. We start by adopting a sliding window to sample the original data into several consecutive time periods. Second, we define a given PSI for tracking pieces of data. We then compare the windows for different values of the parameter, and we generate the corresponding MDS maps of ‘points’. Third, we use Procrustes analysis to linearly transform the MDS charts for maximum superposition and to build a global MDS map of “shapes”. This final plot captures the time evolution of the phenomena and is sensitive to the PSI adopted. The generalized correlation, the Minkowski distance and four entropy-based indices are tested. The proposed approach is applied to the Dow Jones Industrial Average stock market index and the Europe Brent Spot Price FOB time-series.
Resumo:
This paper studies forest fires from the perspective of dynamical systems. Burnt area, precipitation and atmospheric temperatures are interpreted as state variables of a complex system and the correlations between them are investigated by means of different mathematical tools. First, we use mutual information to reveal potential relationships in the data. Second, we adopt the state space portrait to characterize the system’s behavior. Third, we compare the annual state space curves and we apply clustering and visualization tools to unveil long-range patterns. We use forest fire data for Portugal, covering the years 1980–2003. The territory is divided into two regions (North and South), characterized by different climates and vegetation. The adopted methodology represents a new viewpoint in the context of forest fires, shedding light on a complex phenomenon that needs to be better understood in order to mitigate its devastating consequences, at both economical and environmental levels.
Resumo:
This paper studies the statistical distributions of worldwide earthquakes from year 1963 up to year 2012. A Cartesian grid, dividing Earth into geographic regions, is considered. Entropy and the Jensen–Shannon divergence are used to analyze and compare real-world data. Hierarchical clustering and multi-dimensional scaling techniques are adopted for data visualization. Entropy-based indices have the advantage of leading to a single parameter expressing the relationships between the seismic data. Classical and generalized (fractional) entropy and Jensen–Shannon divergence are tested. The generalized measures lead to a clear identification of patterns embedded in the data and contribute to better understand earthquake distributions.
Resumo:
Every year forest fires consume large areas, being a major concern in many countries like Australia, United States and Mediterranean Basin European Countries (e.g., Portugal, Spain, Italy and Greece). Understanding patterns of such events, in terms of size and spatiotemporal distributions, may help to take measures beforehand in view of possible hazards and decide strategies of fire prevention, detection and suppression. Traditional statistical tools have been used to study forest fires. Nevertheless, those tools might not be able to capture the main features of fires complex dynamics and to model fire behaviour [1]. Forest fires size-frequency distributions unveil long range correlations and long memory characteristics, which are typical of fractional order systems [2]. Those complex correlations are characterized by self-similarity and absence of characteristic length-scale, meaning that forest fires exhibit power-law (PL) behaviour. Forest fires have also been proved to exhibit time-clustering phenomena, with timescales of the order of few days [3]. In this paper, we study forest fires in the perspective of dynamical systems and fractional calculus (FC). Public domain forest fires catalogues, containing data of events occurred in Portugal, in the period 1980 up to 2011, are considered. The data is analysed in an annual basis, modelling the occurrences as sequences of Dirac impulses. The frequency spectra of such signals are determined using Fourier transforms, and approximated through PL trendlines. The PL parameters are then used to unveil the fractional-order dynamics characteristics of the data. To complement the analysis, correlation indices are used to compare and find possible relationships among the data. It is shown that the used approach can be useful to expose hidden patterns not captured by traditional tools.
Resumo:
Proceeding of the 3rd International Conference on Fractional Systems and Signals, at Ghent, Belgium
Resumo:
A competitividade no fabrico de componentes para a indústria automóvel é um factor-chave para o sucesso de qualquer empresa que queira permanecer neste sector de actividade. Atendendo a que o custo de mão-de-obra tem tendência a subir, e que a qualidade é muito mais difícil de assegurar quando os processos assentam essencialmente em produção manual, a automatização ganha cada vez maior relevo, permitindo uma maior produtividade e repetibilidade, assegurando simultaneamente níveis de qualidade superiores, o que contribui também para um incremento da produtividade ainda mais acentuado. Em Portugal, muitas empresas que trabalham para o sector automóvel já apostam fortemente na automatização de processos, e até na robotização. Esta é a única via para melhorar a competitividade e conseguir concorrer com países onde a mão-de-obra é bastante mais económica, ou com outros onde a automação está fortemente instalada. Este trabalho centrou-se na optimização de um equipamento destinado ao fabrico semiautomático de estruturas de assentamento dos estofos para automóveis. O equipamento original estava já fortemente automatizado, mas necessitava ainda de algumas operações manuais, as quais se resumiam a pouco mais do que transferência e agrupamento de subconjuntos. O trabalho teve que ter em conta todas as limitações impostas pelos sistemas já existentes, e ser realizável com o custo mais económico possível. Depois de vários estudos e propostas, o projecto foi implementado.
Resumo:
Further improvements in demand response programs implementation are needed in order to take full advantage of this resource, namely for the participation in energy and reserve market products, requiring adequate aggregation and remuneration of small size resources. The present paper focuses on SPIDER, a demand response simulation that has been improved in order to simulate demand response, including realistic power system simulation. For illustration of the simulator’s capabilities, the present paper is proposes a methodology focusing on the aggregation of consumers and generators, providing adequate tolls for the demand response program’s adoption by evolved players. The methodology proposed in the present paper focuses on a Virtual Power Player that manages and aggregates the available demand response and distributed generation resources in order to satisfy the required electrical energy demand and reserve. The aggregation of resources is addressed by the use of clustering algorithms, and operation costs for the VPP are minimized. The presented case study is based on a set of 32 consumers and 66 distributed generation units, running on 180 distinct operation scenarios.
Resumo:
In this paper we study several natural and man-made complex phenomena in the perspective of dynamical systems. For each class of phenomena, the system outputs are time-series records obtained in identical conditions. The time-series are viewed as manifestations of the system behavior and are processed for analyzing the system dynamics. First, we use the Fourier transform to process the data and we approximate the amplitude spectra by means of power law functions. We interpret the power law parameters as a phenomenological signature of the system dynamics. Second, we adopt the techniques of non-hierarchical clustering and multidimensional scaling to visualize hidden relationships between the complex phenomena. Third, we propose a vector field based analogy to interpret the patterns unveiled by the PL parameters.
Resumo:
The last 40 years of the world economy are analyzed by means of computer visualization methods. Multidimensional scaling and the hierarchical clustering tree techniques are used. The current Western downturn in favor of Asian partners may still be reversed in the coming decades.
Resumo:
Atmospheric temperatures characterize Earth as a slow dynamics spatiotemporal system, revealing long-memory and complex behavior. Temperature time series of 54 worldwide geographic locations are considered as representative of the Earth weather dynamics. These data are then interpreted as the time evolution of a set of state space variables describing a complex system. The data are analyzed by means of multidimensional scaling (MDS), and the fractional state space portrait (fSSP). A centennial perspective covering the period from 1910 to 2012 allows MDS to identify similarities among different Earth’s locations. The multivariate mutual information is proposed to determine the “optimal” order of the time derivative for the fSSP representation. The fSSP emerges as a valuable alternative for visualizing system dynamics.
Resumo:
A importância da internet é hoje uma realidade cuja facilidade que nos traz de aceder a produtos ouserviços, informação ou até mesmo aproximar pessoas, torna-‐a ainda mais indispensável. Cada vez mais, a nossa vida é feita através internet. Seja uma simples consulta de informação de horário de funcionamento de uma loja, a compra de produtos de uma loja usando plataformas de venda online, transações bancárias ou até operações fiscais, a internet faz parte das nossas vidas. Até novas áreas de negócio surgem com a massificação do uso da internet. Naturalmente, o surgimento de plataformas que repliquem o mundo real no mundo virtual torna‐se bastante óbvio e cada vez mais desejado. A pesquisa de emprego é algo bastante comum no mundo real. Naturalmente, com a internet, surgiram e surgem plataformas dedicadas a esta área. As empresas que disponibilizam empregos recorrem-‐se destas plataformas pois, estão ao alcance de muitos utilizadores e, geralmente, são gratuitas, juntando o melhor de dois mundos. A necessidade de atingir com maior eficácia o publico alvo leva a que surjam plataformas com maior granularidade de áreas de emprego ou então especializadas em determinadas áreas. Contudo, a pesquisa nestas plataformas fica aquém do desejado pois não tem em consideração a relevância de um emprego para o utilizador apresentando resultados irrelevantes. No sentido de oferecer um novo paradigma de pesquisa de empregos, criou-se uma plataforma, dotada de conhecimento, que estende a pesquisa o tipo de pesquisa tradicional obtendo mais resultados com muita relevância para o utilizador.