Skip to content

SFT phase of TwinRL#3424

Draft
int-smart wants to merge 3 commits intohuggingface:mainfrom
int-smart:feat/add_twinrl
Draft

SFT phase of TwinRL#3424
int-smart wants to merge 3 commits intohuggingface:mainfrom
int-smart:feat/add_twinrl

Conversation

@int-smart
Copy link
Copy Markdown

Title

TwinRL implementation

Summary / Motivation

  • PR has SFT/offline phase based on TwinRL Jax code.

Related issues

  • Fixes / Closes: # (if any)
  • Related: # (if any)

What changed

  • Short, concrete bullets explaining the functional changes (how the behavior or output differs now).
  • Short note if this introduces breaking changes and migration steps.

How was this tested (or how to run locally)

  • Tests added: list new tests or test files. pytest -q tests/ -k <keyword>
  • Manual checks / dataset runs performed.
  • Instructions for the reviewer for reproducing with a quick example or CLI (if applicable)

Checklist (required before merge)

  • Linting/formatting run (pre-commit run -a)
  • All tests pass locally (pytest)
  • Documentation updated
  • CI is green
  • Community Review: I have reviewed another contributor's open PR and linked it here: # (insert PR number/link)

Reviewer notes

  • Anything the reviewer should focus on (performance, edge-cases, specific files) or general notes.
  • Anyone in the community is free to review the PR.

@github-actions github-actions Bot added the policies Items related to robot policies label Apr 21, 2026
@int-smart int-smart marked this pull request as draft April 21, 2026 06:41
@s1lent4gnt s1lent4gnt self-assigned this Apr 21, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

policies Items related to robot policies

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants