Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient

This repository provides an implementation of Drama: a Mamba/Mamba2 powered model based reinforcement learning agent.

If you find Drama useful, please reference in your paper:

@inproceedings{
    wang2025drama,
    title={Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient},
    author={Wenlong Wang and Ivana dusparic and Yucheng Shi and Ke Zhang and Vinny Cahill},
    booktitle={The Thirteenth International Conference on Learning Representations},
    year={2025},
    url={https://openreview.net/forum?id=7XIkRgYjK3}
}

Training and Evaluating Instructions

Requirements

Python: 3.10
Operating System: Ubuntu 22.04 recommended (for Windows, use Docker)

Setup Instructions

Create and activate a Conda environment:

conda create --name drama python=3.10
conda activate drama

Note that because mamba-ssm in requirements.txt requires pytorch, so one should install pytorch before requirements.txt.

pip install torch==2.2.1
pip install -r reuqirements.txt

Docker Instructions

Build the Docker image

docker build -f Dockerfile -t drama .

Run the container with GPU support

docker run --gpus all -it --rm drama

Note: Running via Docker may result in slower performance. Please refer here, it is recommended to reproduce the result in ubuntu OS.

Training Instructions

Train with the default hyperparameters (the configuration files can be found in config_files/train.yaml)

python train.py

If one wants to change the hyperparmeter there are two ways:

Edit the configuration file config_files/train.yaml.
Run the train.py with parameters that corresponding to config_files/train.yaml. e.g.,python train.py --Models.WorldModel.Backbone Mamba.

Important parameters:

Drama supports three different dynamic models: Transformer, Mamba and Mamba-2. It supports two type of behaviour models: Actor-critic and PPO.

Code references

We've referenced several other projects during the development of this code:

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
config_files		config_files
csrc/selective_scan		csrc/selective_scan
envs		envs
mamba_ssm		mamba_ssm
sub_models		sub_models
.gitignore		.gitignore
=1.4.0		=1.4.0
Dockerfile		Dockerfile
agents.py		agents.py
atari_performance.csv		atari_performance.csv
env_wrapper.py		env_wrapper.py
eval.py		eval.py
play.py		play.py
readme.md		readme.md
replay_buffer.py		replay_buffer.py
requirements.txt		requirements.txt
tools.py		tools.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient

Training and Evaluating Instructions

Requirements

Setup Instructions

Docker Instructions

Training Instructions

Important parameters:

Code references

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient

Training and Evaluating Instructions

Requirements

Setup Instructions

Docker Instructions

Training Instructions

Important parameters:

Code references

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages