Keep segmentation compatible with typed mujoco-warp by tkelestemur · Pull Request #911 · mujocolab/mjlab

tkelestemur · 2026-04-15T20:47:46Z

This updates mjlab for the segmentation API change in google-deepmind/mujoco_warp#1283 while keeping mjlab's public camera API backward-compatible. mujoco_warp now exposes segmentation as (object_id, object_type) pairs (once the upstream PR is merged(, but mjlab still expects per-pixel geom IDs with -1 for background and -2 for flex hits. To preserve that contract, this adds a small Warp normalization step in the render/sense pipeline that converts the typed segmentation buffer back into mjlab's existing (B, H, W, 1) int32 output before camera data is exposed to sensors, viewers, or manipulation observations. This means downstream code such as camera_segmentation() and camera_target_cube_mask() continues to work unchanged.

This PR also bumps the pinned mujoco-warp revision to the current head commit for that upstream change and adds regression coverage for both the normalization logic and the downstream observation helpers that consume segmentation.

I validated the change with make check, make test, and make docs locally, and on. The main non-obvious tradeoff is that, because the upstream PR is not merged yet, this temporarily pins mjlab to the PR head commit rather than a merged upstream release.

kevinzakka

Thanks for taking this on @tkelestemur!

Reading the review thread on google-deepmind/mujoco_warp#1283, it sounds like the maintainers are okay with the breaking change. Given how new camera segmentation is in mjlab, could we just propagate the break instead of shimming the typed buffer back to the old (-1, -2, geom_id) layout?

WDYT?

tkelestemur · 2026-04-16T00:50:11Z

Thanks @kevinzakka

I agree. I updated the PR to propagate the upstream typed segmentation break instead of shimming it back to the legacy (-1, -2, geom_id) layout. CameraSensorData.segmentation now exposes (object_id, object_type) pairs with shape [B, H, W, 2], and I updated the downstream consumers that depended on the old format, including the target-cube mask logic and the Viser segmentation view.

I also bumped the mujoco_warp pin to the latest head of google-deepmind/mujoco_warp#1283.

Let me know what you think!

StafaH · 2026-04-16T13:40:31Z

@kevinzakka I think it makes more sense for the output to be 1 channel rather than 2 for training. I think the original conversion should just be updated so that the flex objects get a unique integer id after the regular geoms.

…mjwarp-segmentation-compat

Keep segmentation compatible with typed mujoco-warp

110142f

kevinzakka reviewed Apr 16, 2026

View reviewed changes

tkelestemur mentioned this pull request Apr 16, 2026

Semantic segmentation parity google-deepmind/mujoco_warp#1283

Open

Propagate typed segmentation output

6cdc335

tkelestemur added 2 commits April 21, 2026 12:38

Pin mujoco-warp to segmentation PR branch

2729e87

Merge branch 'main' of https://github.com/mujocolab/mjlab into tarik/…

55611df

…mjwarp-segmentation-compat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Keep segmentation compatible with typed mujoco-warp#911

Keep segmentation compatible with typed mujoco-warp#911
tkelestemur wants to merge 4 commits intomujocolab:mainfrom
tkelestemur:tarik/mjwarp-segmentation-compat

tkelestemur commented Apr 15, 2026

Uh oh!

kevinzakka left a comment

Uh oh!

tkelestemur commented Apr 16, 2026

Uh oh!

StafaH commented Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

tkelestemur commented Apr 15, 2026

Uh oh!

kevinzakka left a comment

Choose a reason for hiding this comment

Uh oh!

tkelestemur commented Apr 16, 2026

Uh oh!

StafaH commented Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants