Smoothing Solution for Discrete-Time Nonlinear Stochastic Optimal Control Problem with Model-Reality Differences

Smoothing Solution for Discrete-Time Nonlinear Stochastic Optimal Control Problem with Model-Reality Differences
Author: Sie Long Kek
Publisher:
Total Pages:
Release: 2016
Genre: Mathematics
ISBN:

Download Smoothing Solution for Discrete-Time Nonlinear Stochastic Optimal Control Problem with Model-Reality Differences Book in PDF, Epub and Kindle

In this chapter, the performance of the integrated optimal control and parameter estimation (IOCPE) algorithm is improved using a modified fixed-interval smoothing scheme in order to solve the discrete-time nonlinear stochastic optimal control problem. In our approach, a linear model-based optimal control problem with adding the adjustable parameters into the model used is solved iteratively. The aim is to obtain the optimal solution of the original optimal control problem. In the presence of the random noise sequences in process plant and measurement channel, the state dynamics, which is estimated using Kalman filtering theory, is smoothed in a fixed interval. With such smoothed state estimate sequence that reduces the output residual, the feedback optimal control law is then designed. During the computation procedure, the optimal solution of the modified model-based optimal control problem can be updated at each iteration step. When convergence is achieved, the iterative solution approaches to the correct optimal solution of the original optimal control problem, in spite of model-reality differences. Moreover, the convergence of the resulting algorithm is also given. For illustration, optimal control of a continuous stirred-tank reactor problem is studied and the result obtained shows the efficiency of the approach proposed.

Nonlinear Systems

Nonlinear Systems
Author: Dongbin Lee
Publisher: BoD – Books on Demand
Total Pages: 366
Release: 2016-10-19
Genre: Mathematics
ISBN: 9535127144

Download Nonlinear Systems Book in PDF, Epub and Kindle

The book consists mainly of two parts: Chapter 1 - Chapter 7 and Chapter 8 - Chapter 14. Chapter 1 and Chapter 2 treat design techniques based on linearization of nonlinear systems. An analysis of nonlinear system over quantum mechanics is discussed in Chapter 3. Chapter 4 to Chapter 7 are estimation methods using Kalman filtering while solving nonlinear control systems using iterative approach. Optimal approaches are discussed in Chapter 8 with retarded control of nonlinear system in singular situation, and Chapter 9 extends optimal theory to H-infinity control for a nonlinear control system.Chapters 10 and 11 present the control of nonlinear dynamic systems, twin-rotor helicopter and 3D crane system, which are both underactuated, cascaded dynamic systems. Chapter 12 applies controls to antisynchronization/synchronization in the chaotic models based on Lyapunov exponent theorem, and Chapter 13 discusses developed stability analytic approaches in terms of Lyapunov stability. The analysis of economic activities, especially the relationship between stock return and economic growth, is presented in Chapter 14.

A Smoothing Framework for Stochastic Continuous-time Reinforcement Learning Problem

A Smoothing Framework for Stochastic Continuous-time Reinforcement Learning Problem
Author: Bowen Hu
Publisher:
Total Pages: 67
Release: 2021
Genre:
ISBN:

Download A Smoothing Framework for Stochastic Continuous-time Reinforcement Learning Problem Book in PDF, Epub and Kindle

Reinforcement learning problem embraces many breakthroughs in stochastic discrete-time and deterministic continuous-time systems. Stochastic continuous-time reinforcement learning is an important yet under studied area. In this dissertation, I present a framework to adapt deterministic continuous time temporal difference learning method to stochastic continuous time systems. I first review the temporal difference methods of discrete time and deterministic continuous time. Then I discuss a popular method that solves optimal control problem and verify its accuracy with Merton's problem. Motivated by the fact that the stochastic system and corresponding deterministic system can be as close as possible as the variance term decreases to zero, I introduce a new nonparametric smoothing method that generalizes deterministic continuous time method to stochastic problem by shrinking the variance term of the stochastic process. I demonstrate that the smoothing method outperforms traditional deterministic continuous time temporal difference method in our numerical study of the stochastic pendulum. In the end, I provide the proof of the convergence of the solution of the proposed framework to a corresponding deterministic continuous time solution. If the optimal value function and optimal policy can be obtained by traditional deterministic algorithms, then applying kernel smoothing framework with continuous TD guarantees convergence to the optimal value or policy for stochastic process.

On Sample Size Control in Sample Average Approximations for Solving Smooth Stochastic Programs

On Sample Size Control in Sample Average Approximations for Solving Smooth Stochastic Programs
Author:
Publisher:
Total Pages: 29
Release: 2009
Genre:
ISBN:

Download On Sample Size Control in Sample Average Approximations for Solving Smooth Stochastic Programs Book in PDF, Epub and Kindle

We consider smooth stochastic programs and develop a discrete-time optimal-control problem for adaptively selecting sample sizes in a class of algorithms based on sample average approximations (SAA). The control problem aims to minimize the expected computational cost to obtain a near-optimal solution of a stochastic program and is solved approximately using dynamic programming. The optimal-control problem depends on unknown parameters such as rate of convergence, computational cost per iteration, and sampling error. Hence, we implement the approach within a receding-horizon framework where parameters are estimated and the optimal- control problem is solved repeatedly during the calculations of a SAA algorithm. The resulting sample-size selection policy consistently produces near-optimal solutions in short computing times as compared to other plausible policies in several numerical examples.