-
Notifications
You must be signed in to change notification settings - Fork 154
Pull requests: NVIDIA-NeMo/Automodel
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(test): initialize weights in mimo_v2_flash round-trip fixture
community-request
#2261
opened May 18, 2026 by
khazic
Contributor
Loading…
1 of 2 tasks
fix(eagle3): reject non-positive draft_vocab_size / target_vocab_size
community-request
#2260
opened May 18, 2026 by
khazic
Contributor
Loading…
3 tasks done
fix(eagle3): validate ttt_steps >= 1 instead of returning NaN loss
community-request
#2259
opened May 18, 2026 by
khazic
Contributor
Loading…
2 tasks done
fix(eagle3): avoid UnboundLocalError on empty train dataloader
community-request
#2258
opened May 18, 2026 by
khazic
Contributor
Loading…
2 tasks done
fix(eagle3): flush trailing partial grad-accum window each epoch
community-request
#2257
opened May 18, 2026 by
khazic
Contributor
Loading…
2 tasks done
fix(eagle3): drop dead cur_loss_mask, raise on too-shallow aux recipe
community-request
waiting-on-customer
Waiting on the original author to respond
#2256
opened May 18, 2026 by
khazic
Contributor
Loading…
3 tasks done
feat(model): add Ling 2.0 / BailingMoeV2 (mini, flash, 1T) (#2242)
community-request
waiting-on-customer
Waiting on the original author to respond
#2255
opened May 17, 2026 by
Hayden727
Loading…
3 tasks done
ci: skip uv lock generation on forks
#2252
opened May 15, 2026 by
chtruong814
Contributor
Loading…
3 tasks done
perf(diffusion): improve Flux training throughput
#2251
opened May 15, 2026 by
pthombre
Contributor
Loading…
3 tasks done
docs(distributed): add mixed-precision training guide
#2248
opened May 15, 2026 by
yuhezhang-ai
Contributor
•
Draft
2 of 7 tasks
fix(checkpoint): exclude TE _extra_state keys from load-time mismatch warning
#2247
opened May 15, 2026 by
adil-a
Collaborator
Loading…
2 tasks done
feat(datasets): add S3/MSC object-storage support for MegatronPretrai…
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2234
opened May 14, 2026 by
hawkoli1987
Contributor
Loading…
3 tasks done
feat(loggers): MLflow run resumption, accurate run status, and VLM/MoE coverage
community-request
waiting-on-customer
Waiting on the original author to respond
#2231
opened May 14, 2026 by
rob-luke
Contributor
Loading…
3 tasks done
feat: add support for data_dir_list in [num_samples, path] form
enhancement
New feature or request
#2229
opened May 13, 2026 by
rnyak
Collaborator
Loading…
2 of 3 tasks
ci: Update transformers to latest version 5.8.1
#2223
opened May 13, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
feat(dllm): add DFlash and LLaDA2 SFT recipes
community-request
waiting-on-customer
Waiting on the original author to respond
#2214
opened May 12, 2026 by
kashif
Loading…
3 tasks
fix: call init_weights() instead of initialize_weights() to restore w…
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2213
opened May 12, 2026 by
Meiyim
Loading…
docs(fern): scaffold Fern docs site mirroring published v0.4.0 sidebar
#2196
opened May 8, 2026 by
lbliii
Loading…
7 tasks
feat(deepseek-v4): add Multi-Token Prediction (MTP) training support
community-request
#2191
opened May 8, 2026 by
khazic
Contributor
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.