842 resultados para data movement problem
Resumo:
We prove unique existence of solution for the impedance (or third) boundary value problem for the Helmholtz equation in a half-plane with arbitrary L∞ boundary data. This problem is of interest as a model of outdoor sound propagation over inhomogeneous flat terrain and as a model of rough surface scattering. To formulate the problem and prove uniqueness of solution we introduce a novel radiation condition, a generalization of that used in plane wave scattering by one-dimensional diffraction gratings. To prove existence of solution and a limiting absorption principle we first reformulate the problem as an equivalent second kind boundary integral equation to which we apply a form of Fredholm alternative, utilizing recent results on the solvability of integral equations on the real line in [5].
Resumo:
Data assimilation (DA) systems are evolving to meet the demands of convection-permitting models in the field of weather forecasting. On 19 April 2013 a special interest group meeting of the Royal Meteorological Society brought together UK researchers looking at different aspects of the data assimilation problem at high resolution, from theory to applications, and researchers creating our future high resolution observational networks. The meeting was chaired by Dr Sarah Dance of the University of Reading and Dr Cristina Charlton-Perez from the MetOffice@Reading. The purpose of the meeting was to help define the current state of high resolution data assimilation in the UK. The workshop assembled three main types of scientists: observational network specialists, operational numerical weather prediction researchers and those developing the fundamental mathematical theory behind data assimilation and the underlying models. These three working areas are intrinsically linked; therefore, a holistic view must be taken when discussing the potential to make advances in high resolution data assimilation.
Resumo:
Operational forecasting centres are currently developing data assimilation systems for coupled atmosphere-ocean models. Strongly coupled assimilation, in which a single assimilation system is applied to a coupled model, presents significant technical and scientific challenges. Hence weakly coupled assimilation systems are being developed as a first step, in which the coupled model is used to compare the current state estimate with observations, but corrections to the atmosphere and ocean initial conditions are then calculated independently. In this paper we provide a comprehensive description of the different coupled assimilation methodologies in the context of four dimensional variational assimilation (4D-Var) and use an idealised framework to assess the expected benefits of moving towards coupled data assimilation. We implement an incremental 4D-Var system within an idealised single column atmosphere-ocean model. The system has the capability to run both strongly and weakly coupled assimilations as well as uncoupled atmosphere or ocean only assimilations, thus allowing a systematic comparison of the different strategies for treating the coupled data assimilation problem. We present results from a series of identical twin experiments devised to investigate the behaviour and sensitivities of the different approaches. Overall, our study demonstrates the potential benefits that may be expected from coupled data assimilation. When compared to uncoupled initialisation, coupled assimilation is able to produce more balanced initial analysis fields, thus reducing initialisation shock and its impact on the subsequent forecast. Single observation experiments demonstrate how coupled assimilation systems are able to pass information between the atmosphere and ocean and therefore use near-surface data to greater effect. We show that much of this benefit may also be gained from a weakly coupled assimilation system, but that this can be sensitive to the parameters used in the assimilation.
Resumo:
An extensive sample (2%) of private vehicles in Italy are equipped with a GPS device that periodically measures their position and dynamical state for insurance purposes. Having access to this type of data allows to develop theoretical and practical applications of great interest: the real-time reconstruction of traffic state in a certain region, the development of accurate models of vehicle dynamics, the study of the cognitive dynamics of drivers. In order for these applications to be possible, we first need to develop the ability to reconstruct the paths taken by vehicles on the road network from the raw GPS data. In fact, these data are affected by positioning errors and they are often very distanced from each other (~2 Km). For these reasons, the task of path identification is not straightforward. This thesis describes the approach we followed to reliably identify vehicle paths from this kind of low-sampling data. The problem of matching data with roads is solved with a bayesian approach of maximum likelihood. While the identification of the path taken between two consecutive GPS measures is performed with a specifically developed optimal routing algorithm, based on A* algorithm. The procedure was applied on an off-line urban data sample and proved to be robust and accurate. Future developments will extend the procedure to real-time execution and nation-wide coverage.
Resumo:
Cloud computing provides a promising solution to the genomics data deluge problem resulting from the advent of next-generation sequencing (NGS) technology. Based on the concepts of “resources-on-demand” and “pay-as-you-go”, scientists with no or limited infrastructure can have access to scalable and cost-effective computational resources. However, the large size of NGS data causes a significant data transfer latency from the client’s site to the cloud, which presents a bottleneck for using cloud computing services. In this paper, we provide a streaming-based scheme to overcome this problem, where the NGS data is processed while being transferred to the cloud. Our scheme targets the wide class of NGS data analysis tasks, where the NGS sequences can be processed independently from one another. We also provide the elastream package that supports the use of this scheme with individual analysis programs or with workflow systems. Experiments presented in this paper show that our solution mitigates the effect of data transfer latency and saves both time and cost of computation.
Resumo:
Phase equilibrium data regression is an unavoidable task necessary to obtain the appropriate values for any model to be used in separation equipment design for chemical process simulation and optimization. The accuracy of this process depends on different factors such as the experimental data quality, the selected model and the calculation algorithm. The present paper summarizes the results and conclusions achieved in our research on the capabilities and limitations of the existing GE models and about strategies that can be included in the correlation algorithms to improve the convergence and avoid inconsistencies. The NRTL model has been selected as a representative local composition model. New capabilities of this model, but also several relevant limitations, have been identified and some examples of the application of a modified NRTL equation have been discussed. Furthermore, a regression algorithm has been developed that allows for the advisable simultaneous regression of all the condensed phase equilibrium regions that are present in ternary systems at constant T and P. It includes specific strategies designed to avoid some of the pitfalls frequently found in commercial regression tools for phase equilibrium calculations. Most of the proposed strategies are based on the geometrical interpretation of the lowest common tangent plane equilibrium criterion, which allows an unambiguous comprehension of the behavior of the mixtures. The paper aims to show all the work as a whole in order to reveal the necessary efforts that must be devoted to overcome the difficulties that still exist in the phase equilibrium data regression problem.
Resumo:
Obtaining wind vectors over the ocean is important for weather forecasting and ocean modelling. Several satellite systems used operationally by meteorological agencies utilise scatterometers to infer wind vectors over the oceans. In this paper we present the results of using novel neural network based techniques to estimate wind vectors from such data. The problem is partitioned into estimating wind speed and wind direction. Wind speed is modelled using a multi-layer perceptron (MLP) and a sum of squares error function. Wind direction is a periodic variable and a multi-valued function for a given set of inputs; a conventional MLP fails at this task, and so we model the full periodic probability density of direction conditioned on the satellite derived inputs using a Mixture Density Network (MDN) with periodic kernel functions. A committee of the resulting MDNs is shown to improve the results.
Resumo:
Obtaining wind vectors over the ocean is important for weather forecasting and ocean modelling. Several satellite systems used operationally by meteorological agencies utilise scatterometers to infer wind vectors over the oceans. In this paper we present the results of using novel neural network based techniques to estimate wind vectors from such data. The problem is partitioned into estimating wind speed and wind direction. Wind speed is modelled using a multi-layer perceptron (MLP) and a sum of squares error function. Wind direction is a periodic variable and a multi-valued function for a given set of inputs; a conventional MLP fails at this task, and so we model the full periodic probability density of direction conditioned on the satellite derived inputs using a Mixture Density Network (MDN) with periodic kernel functions. A committee of the resulting MDNs is shown to improve the results.
Resumo:
The social media classification problems draw more and more attention in the past few years. With the rapid development of Internet and the popularity of computers, there is astronomical amount of information in the social network (social media platforms). The datasets are generally large scale and are often corrupted by noise. The presence of noise in training set has strong impact on the performance of supervised learning (classification) techniques. A budget-driven One-class SVM approach is presented in this thesis that is suitable for large scale social media data classification. Our approach is based on an existing online One-class SVM learning algorithm, referred as STOCS (Self-Tuning One-Class SVM) algorithm. To justify our choice, we first analyze the noise-resilient ability of STOCS using synthetic data. The experiments suggest that STOCS is more robust against label noise than several other existing approaches. Next, to handle big data classification problem for social media data, we introduce several budget driven features, which allow the algorithm to be trained within limited time and under limited memory requirement. Besides, the resulting algorithm can be easily adapted to changes in dynamic data with minimal computational cost. Compared with two state-of-the-art approaches, Lib-Linear and kNN, our approach is shown to be competitive with lower requirements of memory and time.
Resumo:
Recently, we have built a classification model that is capable of assigning a given sesquiterpene lactone (STL) into exactly one tribe of the plant family Asteraceae from which the STL has been isolated. Although many plant species are able to biosynthesize a set of peculiar compounds, the occurrence of the same secondary metabolites in more than one tribe of Asteraceae is frequent. Building on our previous work, in this paper, we explore the possibility of assigning an STL to more than one tribe (class) simultaneously. When an object may belong to more than one class simultaneously, it is called multilabeled. In this work, we present a general overview of the techniques available to examine multilabeled data. The problem of evaluating the performance of a multilabeled classifier is discussed. Two particular multilabeled classification methods-cross-training with support vector machines (ct-SVM) and multilabeled k-nearest neighbors (M-L-kNN)were applied to the classification of the STLs into seven tribes from the plant family Asteraceae. The results are compared to a single-label classification and are analyzed from a chemotaxonomic point of view. The multilabeled approach allowed us to (1) model the reality as closely as possible, (2) improve our understanding of the relationship between the secondary metabolite profiles of different Asteraceae tribes, and (3) significantly decrease the number of plant sources to be considered for finding a certain STL. The presented classification models are useful for the targeted collection of plants with the objective of finding plant sources of natural compounds that are biologically active or possess other specific properties of interest.
Resumo:
As linhas de escrita que conformam o Devir Menor de Alice constituem-se a partir da necessidade de investigar o devir-criança dos corpos aprendentes no processo de aprendizagem da linguagem como potência capaz de engendrar, na imanência dos encontros educativos, um currículo que funcione como plano de constituição de um estilo singular de inscrição de si e do mundo no âmbito da educação infantil. Impulsionado por forças que se desdobram em um campo problemático que orienta um percurso investigativo de caráter cartográfico, o estudo busca problematizar: Por quais processos o problema da escrita pode ordenar um movimento expressivo de aprendizagem da linguagem? Em torno dessa problemática, concebe três questões centrais: (1) De que modo as práticas diferenciais de linguagem traçadas no movimento imanente do currículo deslocam de modo positivo o processo de aprendizagem da linguagem na educação infantil? (2) Do que trata concretamente o conceito de linguagem no movimento expressivo e de aprendizagem afetiva? (3) Por que é relevante abordar o movimento expressivo de aprendizagem da linguagem e por que fazer isso a partir do problema da escrita? Nesse processo investigativo, afirma que a aprendizagem da linguagem implica processos de subjetivação pelos quais a ideia do delírio, do sonho, do sonambulismo de Alice traz para a leitura e a escrita a necessária relação com a tradução: uma leitura que, ao invés de ler o real, o traduz com as forças intensivas do mundo, produzindo afecções nos corpos envolvidos (o leitor, o escritor, o texto, o próprio entorno) e fazendo variar sua potência; uma leitura que envolve as artistagens tradutórias de um agenciamento coletivo de enunciação pelo traçado de linhas de escrita e de vida; uma escrita como invenção: inscrição singular de um si-mundo; traçado desejante de criação. Pretende, portanto, defender que a aprendizagem da linguagem acontece como atividade expressiva quando há composição de um bloco de devir-aprendente por meio da criação de um estilo. Para tanto, recorre a algumas ferramentas conceituais produzidas por Gilles Deleuze (1925-1995) e Félix Guattari (1930-1992), Gilbert Simondon (1924-1989), Baruck Spinoza (1632-1677), Suely Rolnik (2006), Sandra Corazza (2013), Virginia Kastrup (1999) e Walter Kohan (2007), de modo a tentar intervir concretamente nas discussões em torno dos conceitos de aprendizagem, linguagem, tempo, signos, acontecimento, devir, escrita e estilo, seguindo pelo conceito de individuação e pelos movimentos de exploração intensiva dos meios cartografados nos encontros educativos estabelecidos em um Centro de Educação Infantil de Vitória.
Resumo:
The forest has a crucial ecological role and the continuous forest loss can cause colossal effects on the environment. As Armenia is one of the low forest covered countries in the world, this problem is more critical. Continuous forest disturbances mainly caused by illegal logging started from the early 1990s had a huge damage on the forest ecosystem by decreasing the forest productivity and making more areas vulnerable to erosion. Another aspect of the Armenian forest is the lack of continuous monitoring and absence of accurate estimation of the level of cuts in some years. In order to have insight about the forest and the disturbances in the long period of time we used Landsat TM/ETM + images. Google Earth Engine JavaScript API was used, which is an online tool enabling the access and analysis of a great amount of satellite imagery. To overcome the data availability problem caused by the gap in the Landsat series in 1988- 1998, extensive cloud cover in the study area and the missing scan lines, we used pixel based compositing for the temporal window of leaf on vegetation (June-late September). Subsequently, pixel based linear regression analyses were performed. Vegetation indices derived from the 10 biannual composites for the years 1984-2014 were used for trend analysis. In order to derive the disturbances only in forests, forest cover layer was aggregated and the original composites were masked. It has been found, that around 23% of forests were disturbed during the study period.
Resumo:
Relatório de estágio de mestrado em Ensino do 1.º e 2.º Ciclo do Ensino Básico
Resumo:
In this paper we show that the inclusion of unemployment-tenure interaction variates in Mincer wage equations is subject to serious pitfalls. These variates were designed to test whether or not the sensitivity to the business cycle of a worker’s wage varies according to her tenure. We show that three canonical variates used in the literature - the minimum unemployment rate during a worker’s time at the firm(min u), the unemployment rate at the start of her tenure(Su) and the current unemployment rate interacted with a new hire dummy(δu) - can all be significant and "correctly" signed even when each worker in the firm receives the same wage, regardless of tenure (equal treatment). In matched data the problem can be resolved by the inclusion in the panel of firm-year interaction dummies. In unmatched data where this is not possible, we propose a solution for min u and Su based on Solon, Barsky and Parker’s(1994) two step method. This method is sub-optimal because it ignores a large amount of cross tenure variation in average wages and is only valid when the scaled covariances of firm wages and firm employment are acyclical. Unfortunately δu cannot be identified in unmatched data because a differential wage response to unemployment of new hires and incumbents will appear under both equal treatment and unequal treatment.
Resumo:
Analyses of mitochondrial DNA (mtDNA) control region polymorphism and of variation at 10 nuclear microsatellite loci were used to investigate the mechanisms and genetic consequences of postglacial expansion of Myotis myotis in Europe. Initial sampling consisted of 480 bats genotyped in 24 nursery colonies arranged along a transect of approximately 3000 km. The phylogeographical survey based on mtDNA sequences revealed the existence of major genetic subdivisions across this area, with several suture zones between haplogroups. Such zones of secondary contact were found in the Alps and Rhodopes, whereas other potential barriers to gene flow, like the Pyrenees, did not coincide with genetic discontinuities. Areas of population admixture increased locally the genetic diversity of colonies, which confounded the northward decrease in nucleotide diversity predicted using classical models of postglacial range expansion. However, when analyses were restricted to a subset of 15 nurseries originating from a single presumed glacial refugium, mtDNA polymorphism did indeed support a northwards decrease in diversity. Populations were also highly structured (PhiST = 0.384). Conversely, the same subset of colonies showed no significant latitudinal decrease in microsatellite diversity and much less population structure (FST = 0.010), but pairwise genetic differentiation at these nuclear markers was strongly correlated with increasing geographical distance. Together, this evidence suggests that alleles carried via male bats have maintained enough nuclear gene flow to counteract the effects of recurrent bottlenecks generally associated with recolonization processes. As females are highly philopatric, we argue that the maternally transmitted mtDNA marker better reflects the situation of past, historical gene flow, whereas current levels of gene flow are better reflected by microsatellite markers.