Neural Optimal Control using Learned System Dynamics

Selim Engin; Volkan Isler

doi:10.1109/ICRA48891.2023.10160339

Neural Optimal Control using Learned System Dynamics

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

We study the problem of generating control laws for systems with unknown dynamics. Our approach is to represent the controller and the value function with neural networks, and to train them using loss functions adapted from the Hamilton-Jacobi-Bellman (HJB) equations. In the absence of a known dynamics model, our method first learns the state transitions from data collected by interacting with the system in an offline process. The learned transition function is then integrated to the HJB equations and used to forward simulate the control signals produced by our controller in a feedback loop. In contrast to trajectory optimization methods that optimize the controller for a single initial state, our controller can generate near-optimal control signals for initial states from a large portion of the state space. Compared to recent model-based reinforcement learning algorithms, we show that our method is more sample efficient and trains faster by an order of magnitude. We demonstrate our method in a number of tasks, including the control of a quadrotor with 12 state variables.

Original language	English (US)
Title of host publication	Proceedings - ICRA 2023
Subtitle of host publication	IEEE International Conference on Robotics and Automation
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	953-960
Number of pages	8
ISBN (Electronic)	9798350323658
DOIs	https://doi.org/10.1109/ICRA48891.2023.10160339
State	Published - 2023
Externally published	Yes
Event	2023 IEEE International Conference on Robotics and Automation, ICRA 2023 - London, United Kingdom Duration: May 29 2023 → Jun 2 2023

Publication series

Name	Proceedings - IEEE International Conference on Robotics and Automation
Volume	2023-May
ISSN (Print)	1050-4729

Conference

Conference	2023 IEEE International Conference on Robotics and Automation, ICRA 2023
Country/Territory	United Kingdom
City	London
Period	5/29/23 → 6/2/23

Bibliographical note

Publisher Copyright:
© 2023 IEEE.

Access

10.1109/ICRA48891.2023.10160339

OpenUrl availability

Full text

Cite this

Engin, S., & Isler, V. (2023). Neural Optimal Control using Learned System Dynamics. In Proceedings - ICRA 2023: IEEE International Conference on Robotics and Automation (pp. 953-960). (Proceedings - IEEE International Conference on Robotics and Automation; Vol. 2023-May). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICRA48891.2023.10160339

Neural Optimal Control using Learned System Dynamics. / Engin, Selim; Isler, Volkan.
Proceedings - ICRA 2023: IEEE International Conference on Robotics and Automation. Institute of Electrical and Electronics Engineers Inc., 2023. p. 953-960 (Proceedings - IEEE International Conference on Robotics and Automation; Vol. 2023-May).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Engin, S & Isler, V 2023, Neural Optimal Control using Learned System Dynamics. in Proceedings - ICRA 2023: IEEE International Conference on Robotics and Automation. Proceedings - IEEE International Conference on Robotics and Automation, vol. 2023-May, Institute of Electrical and Electronics Engineers Inc., pp. 953-960, 2023 IEEE International Conference on Robotics and Automation, ICRA 2023, London, United Kingdom, 5/29/23. https://doi.org/10.1109/ICRA48891.2023.10160339

@inproceedings{640081e3eef04b238557dfbca7798f5c,

title = "Neural Optimal Control using Learned System Dynamics",

abstract = "We study the problem of generating control laws for systems with unknown dynamics. Our approach is to represent the controller and the value function with neural networks, and to train them using loss functions adapted from the Hamilton-Jacobi-Bellman (HJB) equations. In the absence of a known dynamics model, our method first learns the state transitions from data collected by interacting with the system in an offline process. The learned transition function is then integrated to the HJB equations and used to forward simulate the control signals produced by our controller in a feedback loop. In contrast to trajectory optimization methods that optimize the controller for a single initial state, our controller can generate near-optimal control signals for initial states from a large portion of the state space. Compared to recent model-based reinforcement learning algorithms, we show that our method is more sample efficient and trains faster by an order of magnitude. We demonstrate our method in a number of tasks, including the control of a quadrotor with 12 state variables.",

author = "Selim Engin and Volkan Isler",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 2023 IEEE International Conference on Robotics and Automation, ICRA 2023 ; Conference date: 29-05-2023 Through 02-06-2023",

year = "2023",

doi = "10.1109/ICRA48891.2023.10160339",

language = "English (US)",

series = "Proceedings - IEEE International Conference on Robotics and Automation",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "953--960",

booktitle = "Proceedings - ICRA 2023",

}

TY - GEN

T1 - Neural Optimal Control using Learned System Dynamics

AU - Engin, Selim

AU - Isler, Volkan

PY - 2023

Y1 - 2023

N2 - We study the problem of generating control laws for systems with unknown dynamics. Our approach is to represent the controller and the value function with neural networks, and to train them using loss functions adapted from the Hamilton-Jacobi-Bellman (HJB) equations. In the absence of a known dynamics model, our method first learns the state transitions from data collected by interacting with the system in an offline process. The learned transition function is then integrated to the HJB equations and used to forward simulate the control signals produced by our controller in a feedback loop. In contrast to trajectory optimization methods that optimize the controller for a single initial state, our controller can generate near-optimal control signals for initial states from a large portion of the state space. Compared to recent model-based reinforcement learning algorithms, we show that our method is more sample efficient and trains faster by an order of magnitude. We demonstrate our method in a number of tasks, including the control of a quadrotor with 12 state variables.

AB - We study the problem of generating control laws for systems with unknown dynamics. Our approach is to represent the controller and the value function with neural networks, and to train them using loss functions adapted from the Hamilton-Jacobi-Bellman (HJB) equations. In the absence of a known dynamics model, our method first learns the state transitions from data collected by interacting with the system in an offline process. The learned transition function is then integrated to the HJB equations and used to forward simulate the control signals produced by our controller in a feedback loop. In contrast to trajectory optimization methods that optimize the controller for a single initial state, our controller can generate near-optimal control signals for initial states from a large portion of the state space. Compared to recent model-based reinforcement learning algorithms, we show that our method is more sample efficient and trains faster by an order of magnitude. We demonstrate our method in a number of tasks, including the control of a quadrotor with 12 state variables.

UR - http://www.scopus.com/inward/record.url?scp=85168660071&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85168660071&partnerID=8YFLogxK

U2 - 10.1109/ICRA48891.2023.10160339

DO - 10.1109/ICRA48891.2023.10160339

M3 - Conference contribution

AN - SCOPUS:85168660071

T3 - Proceedings - IEEE International Conference on Robotics and Automation

SP - 953

EP - 960

BT - Proceedings - ICRA 2023

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2023 IEEE International Conference on Robotics and Automation, ICRA 2023

Y2 - 29 May 2023 through 2 June 2023

ER -

Neural Optimal Control using Learned System Dynamics

Abstract

Publication series

Conference

Bibliographical note

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this