reposiTUm: Fast and Data-Efficient Training of Rainbow: an Experimental Study on Atari

Record link:

http://hdl.handle.net/20.500.12708/176002
https://doi.org/10.34726/3909

Title:

Fast and Data-Efficient Training of Rainbow: an Experimental Study on Atari

Citation:

Schmidt, D., & Schmied, T. (2021, December 13). Fast and Data-Efficient Training of Rainbow: an Experimental Study on Atari [Poster Presentation]. Deep RL Workshop NeurIPS 2021, Online, Unknown. https://doi.org/10.34726/3909

reposiTUm DOI:

10.34726/3909

Publication Type:

Presentation - Poster Presentation

Language:

English

Authors:

Schmidt, Dominik
Schmied, Thomas

Organisational Unit:

E194-06 - Forschungsbereich Machine Learning

Date (published):

13-Dec-2021

Event name:

Deep RL Workshop NeurIPS 2021

Event date:

13-Dec-2021

Event place:

Online, Unknown

Keywords:

Machine Learning

Abstract:

Across the Arcade Learning Environment, Rainbow achieves a level of performance competitive with humans and modern RL algorithms. However, attaining this level of performance requires large amounts of data and hardware resources, making research in this area computationally expensive and use in practical applications often infeasible. This paper's contribution is threefold: We (1) propose an improved version of Rainbow, seeking to drastically reduce Rainbow's data, training time, and compute requirements while maintaining its competitive performance; (2) we empirically demonstrate the effectiveness of our approach through experiments on the Arcade Learning Environment, and (3) we conduct a number of ablation studies to investigate the effect of the individual proposed modifications. Our improved version of Rainbow reaches a median human normalized score close to classic Rainbow's, while using 20 times less data and requiring only 7.5 hours of training time on a single GPU. We also provide our full implementation including pre-trained models.

Link (external):

https://sites.google.com/view/deep-rl-workshop-neurips2021/home
https://openreview.net/forum?id=GvM7A3cv63M
https://github.com/schmidtdominik/Rainbow

Research Areas:

Information Systems Engineering: 100%

Science Branch:

1020 - Informatik: 100%

License:

CC BY 4.0