Download Adaptive Dynamic Programming for Control: Algorithms and by Huaguang Zhang, Derong Liu, Yanhong Luo, Ding Wang PDF

By Huaguang Zhang, Derong Liu, Yanhong Luo, Ding Wang

There are many equipment of reliable controller layout for nonlinear platforms. In looking to transcend the minimal requirement of balance, Adaptive Dynamic Programming in Discrete Time ways the difficult subject of optimum keep watch over for nonlinear structures utilizing the instruments of adaptive dynamic programming (ADP). the diversity of platforms taken care of is large; affine, switched, singularly perturbed and time-delay nonlinear platforms are mentioned as are the makes use of of neural networks and strategies of worth and coverage generation. The textual content positive aspects 3 major elements of ADP within which the tools proposed for stabilization and for monitoring and video games enjoy the incorporation of optimum keep watch over equipment:
• infinite-horizon keep an eye on for which the trouble of fixing partial differential Hamilton–Jacobi–Bellman equations without delay is conquer, and facts only if the iterative worth functionality updating series converges to the infimum of all of the price features acquired through admissible keep watch over legislations sequences;
• finite-horizon regulate, applied in discrete-time nonlinear platforms displaying the reader the way to receive suboptimal keep an eye on ideas inside of a hard and fast variety of keep watch over steps and with effects extra simply utilized in genuine platforms than these frequently received from infinite-horizon keep watch over;
• nonlinear video games for which a couple of combined optimum guidelines are derived for fixing video games either whilst the saddle element doesn't exist, and, while it does, fending off the life stipulations of the saddle element.
Non-zero-sum video games are studied within the context of a unmarried community scheme during which guidelines are bought making certain procedure balance and minimizing the person functionality functionality yielding a Nash equilibrium.
In order to make the assurance appropriate for the coed in addition to for the professional reader, Adaptive Dynamic Programming in Discrete Time:
• establishes the basic concept concerned basically with each one bankruptcy dedicated to a essentially identifiable keep an eye on paradigm;
• demonstrates convergence proofs of the ADP algorithms to deepen figuring out of the derivation of balance and convergence with the iterative computational tools used; and
• indicates how ADP tools could be placed to take advantage of either in simulation and in genuine functions.
This textual content may be of substantial curiosity to researchers drawn to optimum regulate and its functions in operations learn, utilized arithmetic computational intelligence and engineering. Graduate scholars operating up to speed and operations learn also will locate the information provided the following to be a resource of strong tools for furthering their study.

Show description

Read Online or Download Adaptive Dynamic Programming for Control: Algorithms and Stability PDF

Best system theory books

Algebraic Methods for Nonlinear Control Systems (Communications and Control Engineering)

This can be a self-contained creation to algebraic keep watch over for nonlinear platforms appropriate for researchers and graduate scholars. it's the first ebook facing the linear-algebraic method of nonlinear keep watch over platforms in this kind of distinctive and large model. It offers a complementary method of the extra conventional differential geometry and bargains extra simply with a number of very important features of nonlinear structures.

Systemantics: How Systems Work and Especially How They Fail

Systemantics: How platforms paintings and particularly How They Fail

Stock Market Modeling and Forecasting: A System Adaptation Approach

Inventory marketplace Modeling and Forecasting interprets adventure in procedure version received in an engineering context to the modeling of economic markets with a purpose to bettering the seize and knowing of marketplace dynamics. The modeling strategy is taken into account as picking out a dynamic procedure within which a true inventory industry is handled as an unknown plant and the identity version proposed is tuned through suggestions of the matching blunders.

Distributed Optimization-Based Control of Multi-Agent Networks in Complex Environments

This ebook bargains a concise and in-depth exposition of particular algorithmic recommendations for allotted optimization established regulate of multi-agent networks and their functionality research. It synthesizes and analyzes disbursed innovations for 3 collaborative initiatives: allotted cooperative optimization, cellular sensor deployment and multi-vehicle formation keep watch over.

Additional info for Adaptive Dynamic Programming for Control: Algorithms and Stability

Sample text

IEEE Trans Autom Sci Eng 2(2):121–131 109. Yen GG, Hickey TW (2004) Reinforcement learning algorithms for robotic navigation in dynamic environments. ISA Trans 43:217–230 110. Zadorojniy A, Shwartz A (2006) Robustness of policies in constrained Markov decision processes. IEEE Trans Autom Control 51(4):635–638 111. Zattoni E (2008) Structural invariant subspaces of singular Hamiltonian systems and nonrecursive solutions of finite-horizon optimal control problems. IEEE Trans Autom Control 53(5):1279–1284 112.

53). With the same state vector x(k) = (x (1) (k), x (2) (k), . . , x (p) (k)) and x(k + 1) = x (1) (k + 1), x (2) (k + 1), . . , x (p) (k + 1) , compute the resultant output target λi+1 (x(k)) = λi+1 (x (1) (k)), λi+1 (x (2) (k)), . . 50). 5. Set wc(i+1) = wci . With the data set (x (j ) (k), λi+1 (x (j ) (k))), j = 1, 2, . . 60) for jmax steps to get the approximate costate function λˆ i+1 . 6. With the data set (x (j ) (k), vi (x (j ) (k))), j = 1, 2, . . 64) for jmax steps to get the approximate control law vˆi .

Hua X, Mizukami K (1994) Linear-quadratic zero-sum differential games for generalized state space systems. IEEE Trans Autom Control 39(1):143–147 44. Hwnag KS, Chiou JY, Chen TY (2004) Reinforcement learning in zero-sum Markov games for robot soccer systems. In: Proceedings of the 2004 IEEE international conference on networking, sensing and control, Taipei, Taiwan, pp 1110–1114 45. Jamshidi M (1982) Large-scale systems-modeling and control. North-Holland, Amsterdam 46. Javaherian H, Liu D, Zhang Y, Kovalenko O (2004) Adaptive critic learning techniques for automotive engine control.

Download PDF sample

Rated 4.19 of 5 – based on 10 votes