Skip to content

Pull requests: unslothai/unsloth

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[trl] Trl v0.28 (and above) rl fixes
#4156 opened Mar 4, 2026 by Datta0 Loading…
Moe kernels refactor
#4145 opened Mar 3, 2026 by Datta0 Loading…
Completion mask fix
#4140 opened Mar 2, 2026 by pluesclues Loading…
Add Idefics3 support (Granite Docling VLM)
#4090 opened Feb 22, 2026 by gaztrabisme Loading…
Fix TRL 0.25.1+ GRPO vision crash and reward function TypeError
#3975 opened Feb 3, 2026 by danielhanchen Loading…
5 tasks done
Asft plus
#3918 opened Jan 21, 2026 by hcsolakoglu Draft
introduce device_context to simplify code.
#3875 opened Jan 11, 2026 by ykaitao Loading…
feat: add mlx model and trainer
#3856 opened Jan 6, 2026 by JINO-ROHIT Loading…
Add context parallelism support (SDPA only)
#3823 opened Jan 2, 2026 by djsaunde Loading…
1 of 2 tasks
fix: propagate revision to vLLM fast_inference
#3816 opened Jan 2, 2026 by majiayu000 Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.