upgrade torchao to 0.17.0 by winglian · Pull Request #3569 · axolotl-ai-cloud/axolotl

winglian · 2026-04-01T16:07:12Z

Summary by CodeRabbit

Release Notes

Chores
- Updated dependencies: torchao to 0.17.0 and mistral-common to 1.11.0
Improvements
- Enhanced quantization configuration handling for improved compatibility with the latest library versions

coderabbitai · 2026-04-01T16:07:27Z

Important

Review skipped

Auto incremental reviews are disabled on this repository.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: b9195b60-b161-48ec-a70b-8dd878f72ac9

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

📝 Walkthrough

Walkthrough

This PR updates pinned dependencies in requirements.txt (torchao 0.16.0→0.17.0, mistral-common 1.10.0→1.11.0) and adapts the codebase to the new torchao API by updating import paths, replacing deprecated quantization config types, and reflecting changes in test assertions.

Changes

Cohort / File(s)	Summary
Dependency Updates `requirements.txt`	Bumped torchao from 0.16.0 to 0.17.0 and mistral-common from 1.10.0 to 1.11.0.
Torchao API Updates `src/axolotl/core/builders/base.py`	Updated `AdamWFp8` import from `torchao.prototype.low_bit_optim` to `torchao.optim.adam` to reflect new torchao module structure.
Quantization Config Adapter `src/axolotl/utils/quantization.py`	Replaced deprecated `Int8DynamicActivationInt4WeightConfig` with `Int8DynamicActivationIntxWeightConfig`; updated `NVFP4InferenceConfig` to `NVFP4WeightOnlyConfig` for torchao ≥2.8.0; refactored config construction to use kwargs with explicit weight_dtype and conditional weight_granularity.
Test Updates `tests/e2e/test_quantization.py`	Updated PTQ test config imports and assertions to expect `Int8DynamicActivationIntxWeightConfig` instead of deprecated int4-weight variant.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Possibly related PRs

Migrate QAT API; fix axolotl quantize for QAT-ed models; add NVFP4 #3107: Modifies the same quantization.py file with similar Int8DynamicActivationInt4WeightConfig replacements and torchao version alignment.
upgrade transformers to 5.2.0 and torchao to 0.16.0 #3407: Also updates the torchao version pinning in requirements.txt and introduces related API adaptation changes.

Suggested reviewers

SalmanMohammadi

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'upgrade torchao to 0.17.0' directly and clearly describes the main change across all modified files, which centers on upgrading the torchao dependency and adapting code to work with the new version.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch torchao-0170

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

src/axolotl/utils/quantization.py (1)
103-106: Consider adding test coverage for the weight_granularity parameter.

The new code path that sets weight_granularity=PerGroup(group_size=...) when group_size is provided (Lines 104-105) is not directly validated in the tests. Per tests/e2e/test_quantization.py:63-83, the ptq_config_test_cases entry for (int4, int8) uses group_size=None, and the test at Lines 117-128 only validates the returned type via isinstance().

While ptq_test_cases does include group_size=8 for this combination, it only validates the quantized tensor class, not the config's weight_granularity attribute. Consider adding a test case that verifies the PerGroup granularity is correctly set.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/axolotl/utils/quantization.py` around lines 103 - 106, Add a unit test
that covers the code path when group_size is provided by calling the same
factory that builds the Int8DynamicActivationIntxWeightConfig (the code that
currently sets kwargs = {"weight_dtype": torch.int4} and returns
Int8DynamicActivationIntxWeightConfig(**kwargs)), pass group_size=8 (or another
value), and assert that the returned config.weight_granularity is an instance of
PerGroup and that its group_size equals the value passed; this ensures the
PerGroup(weight_granularity=...) branch is exercised and validated.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/axolotl/core/builders/base.py`:
- Around line 331-335: The import for AdamWFp8 is using the wrong module path;
inside the branch that checks self.cfg.optimizer == "ao_adamw_fp8" replace the
import statement so it imports AdamWFp8 from torchao.optim (i.e., use "from
torchao.optim import AdamWFp8"), leaving the rest of the block (setting
optimizer_cls = AdamWFp8 and updating optimizer_kwargs with adam_kwargs)
unchanged so optimizer_cls and optimizer_kwargs continue to work as before.

---

Nitpick comments:
In `@src/axolotl/utils/quantization.py`:
- Around line 103-106: Add a unit test that covers the code path when group_size
is provided by calling the same factory that builds the
Int8DynamicActivationIntxWeightConfig (the code that currently sets kwargs =
{"weight_dtype": torch.int4} and returns
Int8DynamicActivationIntxWeightConfig(**kwargs)), pass group_size=8 (or another
value), and assert that the returned config.weight_granularity is an instance of
PerGroup and that its group_size equals the value passed; this ensures the
PerGroup(weight_granularity=...) branch is exercised and validated.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 65fbea7f-3aad-4d12-887e-953d06c75a8f

📥 Commits

Reviewing files that changed from the base of the PR and between 96ae8bd and c9ffee1.

📒 Files selected for processing (4)

requirements.txt
src/axolotl/core/builders/base.py
src/axolotl/utils/quantization.py
tests/e2e/test_quantization.py

coderabbitai · 2026-04-01T16:14:48Z

            elif self.cfg.optimizer == "ao_adamw_fp8":
-                from torchao.prototype.low_bit_optim import AdamWFp8
+                from torchao.optim.adam import AdamWFp8

                optimizer_cls = AdamWFp8
                optimizer_kwargs.update(adam_kwargs)


⚠️ Potential issue | 🔴 Critical

🧩 Analysis chain

🌐 Web query:

What is the import path for AdamWFp8 in torchao 0.17.0?

💡 Result:

The import path for AdamWFp8 in torchao 0.17.0 is from torchao.optim import AdamWFp8.

Citations:

1: https://pypi.org/project/torchao/0.17.0/

2: https://github.com/pytorch/ao

3: Not Seeing much memory savings with Fp8 optimizer suddenly pytorch/ao#1499

Fix incorrect import path for AdamWFp8 in torchao 0.17.0.

The import path should be from torchao.optim import AdamWFp8, not from torchao.optim.adam import AdamWFp8. Update line 332 to use the correct module path.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@src/axolotl/core/builders/base.py` around lines 331 - 335, The import for AdamWFp8 is using the wrong module path; inside the branch that checks self.cfg.optimizer == "ao_adamw_fp8" replace the import statement so it imports AdamWFp8 from torchao.optim (i.e., use "from torchao.optim import AdamWFp8"), leaving the rest of the block (setting optimizer_cls = AdamWFp8 and updating optimizer_kwargs with adam_kwargs) unchanged so optimizer_cls and optimizer_kwargs continue to work as before.

codecov · 2026-04-01T16:23:55Z

Codecov Report

❌ Patch coverage is 62.92135% with 33 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/axolotl/monkeypatch/torchao_optim.py	60.00%	30 Missing ⚠️
src/axolotl/utils/quantization.py	75.00%	2 Missing ⚠️
src/axolotl/core/builders/base.py	0.00%	1 Missing ⚠️

📢 Thoughts on this report? Let us know!

NanoCode012 · 2026-04-02T03:41:14Z

+import torch
+from torch.utils._python_dispatch import return_and_correct_aliasing
+
+logger = logging.getLogger(__name__)


Use the axolotl logger and not this

winglian · 2026-04-02T12:30:03Z

The upstream fixes for the ao 8bit and 4bit optimizers are here: pytorch/ao#4216, but we've patched the aten's in axolotl for now.

winglian added the scheduled_release This PR is slated for the upcoming release label Apr 1, 2026

coderabbitai Bot reviewed Apr 1, 2026

View reviewed changes

winglian mentioned this pull request Apr 1, 2026

MX QAT patch #3553

Merged

winglian added 4 commits April 1, 2026 22:29

upgrade to torchao 0.17.0

994b082

upgrade mistral-common too

e75cb56

chore: lint

15eab2e

patch fix for torchao low bit optimizers

143bbea

winglian force-pushed the torchao-0170 branch from c9ffee1 to 143bbea Compare April 1, 2026 23:56

winglian added 2 commits April 2, 2026 01:46

fix up

938be42

propagate dtype

e466c28

NanoCode012 reviewed Apr 2, 2026

View reviewed changes

winglian added 2 commits April 2, 2026 12:13

fix test for ao change

56a8e62

address PR comments

2a922a1

winglian merged commit 573726c into main Apr 2, 2026
15 of 19 checks passed

winglian deleted the torchao-0170 branch April 2, 2026 14:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

upgrade torchao to 0.17.0#3569

upgrade torchao to 0.17.0#3569
winglian merged 8 commits into
mainfrom
torchao-0170

winglian commented Apr 1, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Apr 1, 2026 •

edited

Loading

Review skipped

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested reviewers

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot Apr 1, 2026

Uh oh!

codecov Bot commented Apr 1, 2026 •

edited

Loading

Uh oh!

Uh oh!

NanoCode012 Apr 2, 2026

Uh oh!

winglian commented Apr 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

winglian commented Apr 1, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Release Notes

Uh oh!

coderabbitai Bot commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested reviewers

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

codecov Bot commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

NanoCode012 Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

winglian commented Apr 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

winglian commented Apr 1, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Apr 1, 2026 •

edited

Loading

codecov Bot commented Apr 1, 2026 •

edited

Loading