Explainable RL
solving atari games with neurosymbolic explainable plans
Overview
This project focuses on explainable reinforcement learning by developing object-centric representations and surrogate models to interpret the decision-making of Deep Q-Network (DQN) agents trained on ATARI games.
Key Contributions
- DQN Training & Object-Centric Representations
- Trained DQN agents on ATARI environments, Pong,Boxing.
- Contributed to a neuro-symbolic RL framework for object-centric learning.
- Surrogate Models for Explainability
- Distilled DQN behavior into decision trees and neural networks for exploring imitation implications.
- Leveraged object-centric data to enhance interpretability of RL agents.

- Guided Exploration for RL Agents
- Currently working on region proposal methods to improve exploration strategies in ATARI games.
- This work focuses on code based meta-controller for hard exploration games like PitFall and Montezuma’s revenge
Acknowledgment
Conducted at the AIML Lab, TU Darmstadt, under supervision of Quentin Delfosse and Prof. Kristian Kersting as a HiWi(Funded research assistant position)