Explainable RL | Anurag Maurya

Overview

This project focuses on explainable reinforcement learning by developing object-centric representations and surrogate models to interpret the decision-making of Deep Q-Network (DQN) agents trained on ATARI games.

Key Contributions

DQN Training & Object-Centric Representations
- Trained DQN agents on ATARI environments, Pong,Boxing.
- Contributed to a neuro-symbolic RL framework for object-centric learning.
Surrogate Models for Explainability
- Distilled DQN behavior into decision trees and neural networks for exploring imitation implications.
- Leveraged object-centric data to enhance interpretability of RL agents.

Guided Exploration for RL Agents
- Currently working on region proposal methods to improve exploration strategies in ATARI games.
- This work focuses on code based meta-controller for hard exploration games like PitFall and Montezuma’s revenge

Acknowledgment

Conducted at the AIML Lab, TU Darmstadt, under supervision of Quentin Delfosse and Prof. Kristian Kersting as a HiWi(Funded research assistant position)