Safety verification of model based reinforcement learning controllers using reachability analysis

Gupta, Akshita

doi:10.25394/PGS.9118943.v1

MS_thesis_Akshita_Gupta_v3.pdf (1.11 MB)

Safety verification of model based reinforcement learning controllers using reachability analysis

thesis

posted on 2019-08-13, 19:38 authored by Akshita GuptaAkshita Gupta

Reinforcement Learning (RL) is a data-driven technique which is finding increasing application in the development of controllers for sequential decision making problems. Their wide adoption can be attributed to the fact that the development of these controllers is independent of the

knowledge of the system and thus can be used even when the environment dynamics are unknown. Model-Based RL controllers explicitly model the system dynamics from the observed (training) data using a function approximator, followed by using a path planning algorithm to obtain the optimal control sequence. While these controllers have been proven to be successful in simulations, lack of strong safety guarantees in the presence of noise makes them ill-posed for deployment on hardware, specially in safety critical systems. The proposed work aims at bridging this gap by providing a verification framework to evaluate the safety guarantees for a Model-Based RL controller. Our method builds upon reachability analysis to determine if there is any action which can drive the system into a constrained (unsafe) region. Consequently, our method can provide a binary yes or no answer to whether all the initial set of states are (un)safe to propagate trajectories from in the presence of some bounded noise.

History

Degree Type

Master of Science

Department

Aeronautics and Astronautics

Campus location

West Lafayette

Advisor/Supervisor/Committee Chair

Prof. Inseok Hwang

Additional Committee Member 2

Prof. Martin Corless

Additional Committee Member 3

Prof. Arthur Frazho

Additional Committee Member 4

Prof. Dengfeng Sun

Usage metrics

Keywords

Reachable set Reinforcement learning safety verification Aerospace Engineering Automation and Control Engineering

Licence

CC BY 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Safety verification of model based reinforcement learning controllers using reachability analysis

History

Degree Type

Department

Campus location

Advisor/Supervisor/Committee Chair

Additional Committee Member 2

Additional Committee Member 3

Additional Committee Member 4

Usage metrics

Categories

Keywords

Licence

Exports