Fix PEFT adapter outputs for v5 support

https://github.com/NVIDIA-NeMo/Automodel/pull/1727 CI caught an issue where Qwen3 state dict adapter outputs unfused lora adapters (as was the behavior in transformers v4). In v5, the expectation for Qwen3MoE is to have fused adapters saved for the expert layers. Since it is currently close to release, I limited blast radius to Qwen3, but I have a strong suspicion it affects other MoE custom models as well. It remains to be investigated + fixed accordingly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix PEFT adapter outputs for v5 support #1755

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Fix PEFT adapter outputs for v5 support #1755

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions