Back to Publications
Distributed Deep Reinforcement Learning for Fighting Forest Fires with a Network of Aerial Robots
Ravi N. Haksar, Mac Schwager
2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2018
Abstract
This paper proposes a distributed deep reinforcement learning (RL) based strategy for a team of Unmanned Aerial Vehicles (UAVs) to autonomously fight forest fires. We first model the forest fire as a Markov decision process (MDP) with a factored structure. We consider optimally controlling the forest fire without agents using dynamic programming, and show any exact solution and many approximate solutions are computationally intractable. Given the problem complexity, we consider a deep RL approach in which each agent learns a policy requiring only local information. We show with Monte Carlo simulations that the deep RL policy outperforms a handtuned heuristic, and scales well for various forest sizes and different numbers of UAVs as well as variations in model parameters. Experimental demonstrations with mobile robots fighting a simulated forest fire in the Robotarium at the Georgia Institute of Technology are also presented.
BibTeX
@inproceedings{haksar_distributed_2018,
address = {Madrid, Spain},
title = {Distributed {Deep} {Reinforcement} {Learning} for {Fighting} {Forest} {Fires} with a {Network} of {Aerial} {Robots}},
isbn = {978-1-5386-8094-0},
url = {https://ieeexplore.ieee.org/document/8593539/},
abstract = {This paper proposes a distributed deep reinforcement learning (RL) based strategy for a team of Unmanned Aerial Vehicles (UAVs) to autonomously fight forest fires. We first model the forest fire as a Markov decision process (MDP) with a factored structure. We consider optimally controlling the forest fire without agents using dynamic programming, and show any exact solution and many approximate solutions are computationally intractable. Given the problem complexity, we consider a deep RL approach in which each agent learns a policy requiring only local information. We show with Monte Carlo simulations that the deep RL policy outperforms a handtuned heuristic, and scales well for various forest sizes and different numbers of UAVs as well as variations in model parameters. Experimental demonstrations with mobile robots fighting a simulated forest fire in the Robotarium at the Georgia Institute of Technology are also presented.},
language = {en},
urldate = {2020-09-15},
booktitle = {2018 {IEEE}/{RSJ} {International} {Conference} on {Intelligent} {Robots} and {Systems} ({IROS})},
publisher = {IEEE},
author = {Haksar, Ravi N. and Schwager, Mac},
month = oct,
year = {2018},
keywords = {reinforcement learning, multi-robot, aerial},
pages = {1067--1074},
month_numeric = {10}
}