104 resultados para Controlled stochastic differential equation, Infinite-dimensional stochastic differential equation, Quadratic optimal control


Relevância:

60.00% 60.00%

Publicador:

Resumo:

Differential evolution (DE) is arguably one of the most powerful stochastic real-parameter optimization algorithms of current interest. Since its inception in the mid 1990s, DE has been finding many successful applications in real-world optimization problems from diverse domains of science and engineering. This paper takes a first significant step toward the convergence analysis of a canonical DE (DE/rand/1/bin) algorithm. It first deduces a time-recursive relationship for the probability density function (PDF) of the trial solutions, taking into consideration the DE-type mutation, crossover, and selection mechanisms. Then, by applying the concepts of Lyapunov stability theorems, it shows that as time approaches infinity, the PDF of the trial solutions concentrates narrowly around the global optimum of the objective function, assuming the shape of a Dirac delta distribution. Asymptotic convergence behavior of the population PDF is established by constructing a Lyapunov functional based on the PDF and showing that it monotonically decreases with time. The analysis is applicable to a class of continuous and real-valued objective functions that possesses a unique global optimum (but may have multiple local optima). Theoretical results have been substantiated with relevant computer simulations.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

We consider the problem of developing privacy-preserving machine learning algorithms in a dis-tributed multiparty setting. Here different parties own different parts of a data set, and the goal is to learn a classifier from the entire data set with-out any party revealing any information about the individual data points it owns. Pathak et al [7]recently proposed a solution to this problem in which each party learns a local classifier from its own data, and a third party then aggregates these classifiers in a privacy-preserving manner using a cryptographic scheme. The generaliza-tion performance of their algorithm is sensitive to the number of parties and the relative frac-tions of data owned by the different parties. In this paper, we describe a new differentially pri-vate algorithm for the multiparty setting that uses a stochastic gradient descent based procedure to directly optimize the overall multiparty ob-jective rather than combining classifiers learned from optimizing local objectives. The algorithm achieves a slightly weaker form of differential privacy than that of [7], but provides improved generalization guarantees that do not depend on the number of parties or the relative sizes of the individual data sets. Experimental results corrob-orate our theoretical findings.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Infinite horizon discounted-cost and ergodic-cost risk-sensitive zero-sum stochastic games for controlled Markov chains with countably many states are analyzed. Upper and lower values for these games are established. The existence of value and saddle-point equilibria in the class of Markov strategies is proved for the discounted-cost game. The existence of value and saddle-point equilibria in the class of stationary strategies is proved under the uniform ergodicity condition for the ergodic-cost game. The value of the ergodic-cost game happens to be the product of the inverse of the risk-sensitivity factor and the logarithm of the common Perron-Frobenius eigenvalue of the associated controlled nonlinear kernels. (C) 2013 Elsevier B.V. All rights reserved.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

We propose a novel form of nonlinear stochastic filtering based on an iterative evaluation of a Kalman-like gain matrix computed within a Monte Carlo scheme as suggested by the form of the parent equation of nonlinear filtering (Kushner-Stratonovich equation) and retains the simplicity of implementation of an ensemble Kalman filter (EnKF). The numerical results, presently obtained via EnKF-like simulations with or without a reduced-rank unscented transformation, clearly indicate remarkably superior filter convergence and accuracy vis-a-vis most available filtering schemes and eminent applicability of the methods to higher dimensional dynamic system identification problems of engineering interest. (C) 2013 The Franklin Institute. Published by Elsevier Ltd. All rights reserved.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Infinite arrays of coupled two-state stochastic oscillators exhibit well-defined steady states. We study the fluctuations that occur when the number N of oscillators in the array is finite. We choose a particular form of global coupling that in the infinite array leads to a pitchfork bifurcation from a monostable to a bistable steady state, the latter with two equally probable stationary states. The control parameter for this bifurcation is the coupling strength. In finite arrays these states become metastable: The fluctuations lead to distributions around the most probable states, with one maximum in the monostable regime and two maxima in the bistable regime. In the latter regime, the fluctuations lead to transitions between the two peak regions of the distribution. Also, we find that the fluctuations break the symmetry in the bimodal regime, that is, one metastable state becomes more probable than the other, increasingly so with increasing array size. To arrive at these results, we start from microscopic dynamical evolution equations from which we derive a Langevin equation that exhibits an interesting multiplicative noise structure. We also present a master equation description of the dynamics. Both of these equations lead to the same Fokker-Planck equation, the master equation via a 1/N expansion and the Langevin equation via standard methods of Ito calculus for multiplicative noise. From the Fokker-Planck equation we obtain an effective potential that reflects the transition from the monomodal to the bimodal distribution as a function of a control parameter. We present a variety of numerical and analytic results that illustrate the strong effects of the fluctuations. We also show that the limits N -> infinity and t -> infinity(t is the time) do not commute. In fact, the two orders of implementation lead to drastically different results.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

We propose a simulation-based algorithm for computing the optimal pricing policy for a product under uncertain demand dynamics. We consider a parameterized stochastic differential equation (SDE) model for the uncertain demand dynamics of the product over the planning horizon. In particular, we consider a dynamic model that is an extension of the Bass model. The performance of our algorithm is compared to that of a myopic pricing policy and is shown to give better results. Two significant advantages with our algorithm are as follows: (a) it does not require information on the system model parameters if the SDE system state is known via either a simulation device or real data, and (b) as it works efficiently even for high-dimensional parameters, it uses the efficient smoothed functional gradient estimator.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

A nonlinear stochastic filtering scheme based on a Gaussian sum representation of the filtering density and an annealing-type iterative update, which is additive and uses an artificial diffusion parameter, is proposed. The additive nature of the update relieves the problem of weight collapse often encountered with filters employing weighted particle based empirical approximation to the filtering density. The proposed Monte Carlo filter bank conforms in structure to the parent nonlinear filtering (Kushner-Stratonovich) equation and possesses excellent mixing properties enabling adequate exploration of the phase space of the state vector. The performance of the filter bank, presently assessed against a few carefully chosen numerical examples, provide ample evidence of its remarkable performance in terms of filter convergence and estimation accuracy vis-a-vis most other competing filters especially in higher dimensional dynamic system identification problems including cases that may demand estimating relatively minor variations in the parameter values from their reference states. (C) 2014 Elsevier Ltd. All rights reserved.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The problem of intercepting a maneuvering target at a prespecified impact angle is posed in nonlinear zero-sum differential games framework. A feedback form solution is proposed by extending state-dependent Riccati equation method to nonlinear zero-sum differential games. An analytic solution is obtained for the state-dependent Riccati equation corresponding to the impact-angle-constrained guidance problem. The impact-angle-constrained guidance law is derived using the states line-of-sight rate and projected terminal impact angle error. Local asymptotic stability conditions for the closed-loop system corresponding to these states are studied. Time-to-go estimation is not explicitly required to derive and implement the proposed guidance law. Performance of the proposed guidance law is validated using two-dimensional simulation of the relative nonlinear kinematics as well as a thrust-driven realistic interceptor model.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

In this paper, we study two multi-dimensional Goodness-of-Fit tests for spectrum sensing in cognitive radios. The multi-dimensional scenario refers to multiple CR nodes, each with multiple antennas, that record multiple observations from multiple primary users for spectrum sensing. These tests, viz., the Interpoint Distance (ID) based test and the h, f distance based tests are constructed based on the properties of stochastic distances. The ID test is studied in detail for a single CR node case, and a possible extension to handle multiple nodes is discussed. On the other hand, the h, f test is applicable in a multi-node setup. A robustness feature of the KL distance based test is discussed, which has connections with Middleton's class A model. Through Monte-Carlo simulations, the proposed tests are shown to outperform the existing techniques such as the eigenvalue ratio based test, John's test, and the sphericity test, in several scenarios.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

A role for oestrogen in regulating fluid reabsorption in the monkey epididymis was recently demonstrated. Here, these Studies are extended to identify potential oestrogen-regulated proteins in the cauda region of monkey epididymis treated with vehicle and oestrogen receptor antagonist (ICI 182780). Two-dimensional electrophoretic analysis was used to identify the proteins. The results indicated down-regulation of WNT4 in the ICI-182780-treated monkey cauda. In addition. the Wnt4f mRNA concentration was also reduced in the caput regions of ICI-182780-treated rats and oestrogen receptor knockout mice. WNT4 is a key regulator of gonadal differentiation in humans and mice and plays a pivotal role in early mouse embryogenesis. The results of the present Study establish the presence of WNT4 in the monkey epididymis and its regulation by oestrogen, and Suggest a role for WNT4 in maintaining epididymal homeostasis.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

The unsteady turbulent incompressible boundary-layer flow over two-dimensional and axisymmetric bodies with pressure gradient has been studied. An eddy-viscosity model has been used to model the Reynolds shear stress. The unsteadiness is due to variations in the free stream velocity with time. The nonlinear partial differential equation with three independent variables governing the flow has been solved using Keller's Box method. The results indicate that the free stram velocity distribution exerts strong influence on the boundary-layer characteristics. The point of zero skin friction is found to move upstream as time increases.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

Transparent glasses in the composition BaO-0.5Li(2)O-4.5B(2)O(3) (BLBO) were fabricated via the conventional melt-quenching technique. X-ray powder diffraction combined with differential scanning calorimetric (DSC) studies carried out on the as-quenched samples confirmed their amorphous and glassy nature, respectively. The crystallization behavior of these glasses has been studied by isothermal and nonisothermal methods using DSC. Crystallization kinetic parameters were evaluated from the Johnson-Mehl-Avrami equation. The value of the Avrami exponent (n) was found to be 3.6 +/- 0.1, suggesting that the process involves three-dimensional bulk crystallization. The average value of activation energy associated with the crystallization of BLBO glasses was 317 +/- 10 kJ/mol. Transparent glass-ceramics were fabricated by controlled heat-treatment of the as-quenched glasses at 845 K/40 min. The dielectric constants for BLBO glasses and glass-ceramics in the 100 Hz-10 MHz frequency range were measured as a function of the temperature (300-925 K). The electrical relaxation and dc conductivity characteristics were rationalized using electric modulus formalism. The imaginary part of the electric modulus spectra was modeled using an approximate solution of the Kohlrausch-Williams-Watts relation. The temperature-dependent behavior of stretched exponent (beta) was discussed for the as-quenched and heat-treated BLBO glasses.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

Analytical solution of a 2-dimensional problem of solidification of a superheated liquid in a semi-infinite mould has been studied in this paper. On the boundary, the prescribed temperature is such that the solidification starts simultaneously at all points of the boundary. Results are also given for the 2-dimensional ablation problem. The solution of the heat conduction equation has been obtained in terms of multiple Laplace integrals involving suitable unknown fictitious initial temperatures. These fictitious initial temperatures have interesting physical interpretations. By choosing suitable series expansions for fictitious initial temperatures and moving interface boundary, the unknown quantities can be determined. Solidification thickness has been calculated for short time and effect of parameters on the solidification thickness has been shown with the help of graphs.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

The unsteady laminar compressible boundary-layer flow in the immediate vicinity of a two-dimensional stagnation point due to an incident stream whose velocity varies arbitrarily with time is considered. The governing partial differential equations, involving both time and the independent similarity variable, are transformed into new co-ordinates with finite ranges by means of a transformation which maps an infinite interval into a finite one. The resulting equations are solved by converting them into a matrix equation through the application of implicit finite-difference formulae. Computations have been carried out for two particular unsteady free-stream velocity distributions: (1) a constantly accelerating stream and (2) a fluctuating stream. The results show that in the former case both the skin-friction and the heat-transfer parameter increase steadily with time after a certain instant, while in the latter they oscillate thus responding to the fluctuations in the free-stream velocity.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

This paper describes an algorithm for ``direct numerical integration'' of the initial value Differential-Algebraic Inequalities (DAI) in a time stepping fashion using a sequential quadratic programming (SQP) method solver for detecting and satisfying active path constraints at each time step. The activation of a path constraint generally increases the condition number of the active discretized differential algebraic equation's (DAE) Jacobian and this difficulty is addressed by a regularization property of the alpha method. The algorithm is locally stable when index 1 and index 2 active path constraints and bounds are active. Subject to available regularization it is seen to be stable for active index 3 active path constraints in the numerical examples. For the high index active path constraints, the algorithm uses a user-selectable parameter to perturb the smaller singular values of the Jacobian with a view to reducing the condition number so that the simulation can proceed. The algorithm can be used as a relatively cheaper estimation tool for trajectory and control planning and in the context of model predictive control solutions. It can also be used to generate initial guess values of optimization variables used as input to inequality path constrained dynamic optimization problems. The method is illustrated with examples from space vehicle trajectory and robot path planning.