Skip to content

Pull requests: NVIDIA-NeMo/Megatron-Bridge

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[docs] chore: bump versions1.json to 0.4.0 (latest)
#3376 opened Apr 17, 2026 by ko3n1g Contributor Loading…
[docs] chore: bump versions1.json to 0.4.0 (latest)
#3375 opened Apr 17, 2026 by ko3n1g Contributor Loading…
VR200 cfgs, Lm3 70b, 405b, qwen3 30b, 235b, gpt-oss, kimi
#3374 opened Apr 17, 2026 by malay-nagda Contributor Loading…
5 tasks
[ci] feat: use AWS ephemeral runners for external contributors
#3370 opened Apr 17, 2026 by ko3n1g Contributor Loading…
3 tasks
Optimize refit operations
#3369 opened Apr 17, 2026 by ananthsub Contributor Draft
5 tasks
b200 better cfg
#3368 opened Apr 17, 2026 by malay-nagda Contributor Draft
5 tasks
ci: add GB200 functional test infrastructure full-test-suite
#3365 opened Apr 17, 2026 by ko3n1g Contributor Draft
2 tasks
[training] feat: enable fp4_param_gather in MixedPrecisionConfig 26.04.01 area:perf Performance optimizations and benchmarking performance/release Performance items related with NeMo release performance r0.4.0 Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#3364 opened Apr 16, 2026 by dingqingy-nv Contributor Loading…
2 of 3 tasks
26.04
chore(beep boop 🤖): Bump uv.lock (main, mcore-dev) (2026-04-16) area:build Dependencies, packaging, images, and environment setup ci CI, automation, test queue, or workflow infrastructure work full-test-suite needs-review PR is ready for code review and waiting on a reviewer
#3355 opened Apr 16, 2026 by svcnvidia-nemo-ci Contributor Loading…
chore(beep boop 🤖): Bump uv.lock (r0.4.0, mcore-core_r0.17.0) (2026-04-16) area:build Dependencies, packaging, images, and environment setup ci CI, automation, test queue, or workflow infrastructure work full-test-suite needs-review PR is ready for code review and waiting on a reviewer
#3353 opened Apr 16, 2026 by svcnvidia-nemo-ci Contributor Loading…
[data, docs] feat: Add fast dataloading configs & documentation area:data Dataset builders, preprocessing, and samplers feature New capabilities, enhancements, or enablement work needs-follow-up Issue needs follow-up
#3351 opened Apr 16, 2026 by asolergi-nv Loading…
5 tasks
[perf] fix: use direct assignment for NCCL env vars when nccl_ub enabled area:perf Performance optimizations and benchmarking bug Something isn't working r0.4.0 Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge. ready-to-merge PR is approved, current, and only waiting for CI to pass before merge
#3350 opened Apr 16, 2026 by dingqingy-nv Contributor Loading…
2 tasks
fix(qwen3): add MTP weight mappings to Qwen3Bridge area:model Model implementations and HF bridge logic bug Something isn't working community-request ready-to-merge PR is approved, current, and only waiting for CI to pass before merge
#3349 opened Apr 16, 2026 by Doondi-Ashlesh Loading…
Add dense_grouped_gemm support in GPTModelProvider area:model Model implementations and HF bridge logic feature New capabilities, enhancements, or enablement work ready-to-merge PR is approved, current, and only waiting for CI to pass before merge
#3344 opened Apr 15, 2026 by sraman-rgb Loading…
4 of 5 tasks
cp: Update Qwen3-VL pretrain perf configs for 30B and 235B (3327) into r0.4.0 area:perf Performance optimizations and benchmarking cherry-pick feature New capabilities, enhancements, or enablement work Run CICD
#3342 opened Apr 15, 2026 by svcnvidia-nemo-ci Contributor Loading…
[ckpt, peft] fix: merge LoRA adapters in grouped HF export area:peft Parameter-efficient fine-tuning (LoRA, adapters) bug Something isn't working community-request
#3341 opened Apr 15, 2026 by HollowMan6 Contributor Loading…
2 of 5 tasks
rename models/mimo infra to models/omni_modal area:model Model implementations and HF bridge logic feature New capabilities, enhancements, or enablement work needs-review PR is ready for code review and waiting on a reviewer
#3340 opened Apr 15, 2026 by liding-nv Contributor Loading…
add async save support for fsdp checkpoints area:ckpt Checkpoint conversion, loading, export, and save paths feature New capabilities, enhancements, or enablement work
#3339 opened Apr 15, 2026 by dimapihtar Contributor Loading…
5 tasks
[model] fix: Qwen3.5-VL MTP standard attn specs patch area:model Model implementations and HF bridge logic bug Something isn't working community-request needs-more-tests Requires additional L0 and L1 test coverage before merge ready-to-merge PR is approved, current, and only waiting for CI to pass before merge
#3330 opened Apr 14, 2026 by HollowMan6 Contributor Loading…
2 of 5 tasks
[main] qwen-vl THD packed-sequence support and fixes area:model Model implementations and HF bridge logic community-request feature New capabilities, enhancements, or enablement work needs-review PR is ready for code review and waiting on a reviewer
#3323 opened Apr 14, 2026 by DAISY-gh Contributor Loading…
[model]Add Qwen3‑Omni training support area:model Model implementations and HF bridge logic community-request feature New capabilities, enhancements, or enablement work needs-review PR is ready for code review and waiting on a reviewer
#3317 opened Apr 14, 2026 by hbhflw2000 Loading…
3 of 5 tasks
ProTip! Type g i on any issue or pull request to go back to the issue listing page.