Skip to content

Pull requests: NVIDIA-NeMo/Automodel

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix(test): initialize weights in mimo_v2_flash round-trip fixture community-request
#2261 opened May 18, 2026 by khazic Contributor Loading…
1 of 2 tasks
fix(eagle3): reject non-positive draft_vocab_size / target_vocab_size community-request
#2260 opened May 18, 2026 by khazic Contributor Loading…
3 tasks done
fix(eagle3): validate ttt_steps >= 1 instead of returning NaN loss community-request
#2259 opened May 18, 2026 by khazic Contributor Loading…
2 tasks done
fix(eagle3): avoid UnboundLocalError on empty train dataloader community-request
#2258 opened May 18, 2026 by khazic Contributor Loading…
2 tasks done
fix(eagle3): flush trailing partial grad-accum window each epoch community-request
#2257 opened May 18, 2026 by khazic Contributor Loading…
2 tasks done
fix(eagle3): drop dead cur_loss_mask, raise on too-shallow aux recipe community-request waiting-on-customer Waiting on the original author to respond
#2256 opened May 18, 2026 by khazic Contributor Loading…
3 tasks done
feat(model): add Ling 2.0 / BailingMoeV2 (mini, flash, 1T) (#2242) community-request waiting-on-customer Waiting on the original author to respond
#2255 opened May 17, 2026 by Hayden727 Loading…
3 tasks done
ci: skip uv lock generation on forks
#2252 opened May 15, 2026 by chtruong814 Contributor Loading…
3 tasks done
perf(diffusion): improve Flux training throughput
#2251 opened May 15, 2026 by pthombre Contributor Loading…
3 tasks done
docs(distributed): add mixed-precision training guide
#2248 opened May 15, 2026 by yuhezhang-ai Contributor Draft
2 of 7 tasks
fix(checkpoint): exclude TE _extra_state keys from load-time mismatch warning
#2247 opened May 15, 2026 by adil-a Collaborator Loading…
2 tasks done
[WIP] Add gemma4 drafter model support
#2240 opened May 15, 2026 by athitten Contributor Draft
3 tasks
feat: add use_memory_efficient_lora knob
#2239 opened May 15, 2026 by akoumpa Contributor Draft
3 tasks
feat(datasets): add S3/MSC object-storage support for MegatronPretrai… community-request waiting-on-maintainers Waiting on maintainers to respond
#2234 opened May 14, 2026 by hawkoli1987 Contributor Loading…
3 tasks done
feat(loggers): MLflow run resumption, accurate run status, and VLM/MoE coverage community-request waiting-on-customer Waiting on the original author to respond
#2231 opened May 14, 2026 by rob-luke Contributor Loading…
3 tasks done
feat: add support for data_dir_list in [num_samples, path] form enhancement New feature or request
#2229 opened May 13, 2026 by rnyak Collaborator Loading…
2 of 3 tasks
feat: Abstract LLM/VLM forward-backward step
#2228 opened May 13, 2026 by HuiyingLi Contributor Draft
ci: Update transformers to latest version 5.8.1
#2223 opened May 13, 2026 by svcnvidia-nemo-ci Contributor Loading…
feat(dllm): add DFlash and LLaDA2 SFT recipes community-request waiting-on-customer Waiting on the original author to respond
#2214 opened May 12, 2026 by kashif Loading…
3 tasks
ci: remove build-docs workflow
#2206 opened May 11, 2026 by ko3n1g Contributor Loading…
ProTip! Exclude everything labeled bug with -label:bug.