Skip to content

Pull requests: THUDM/slime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[docker] upgrade sglang to v0.5.13 run-ci-image
#2072 opened Jun 13, 2026 by zhuzilin Contributor Loading…
feat(rollouts) external rollouts endpoint with publish-only weight sync
#2071 opened Jun 12, 2026 by jvmncs Loading…
4 tasks done
[algo] Add CISPO advantage estimator (MiniMax-M1)
#2067 opened Jun 12, 2026 by EazyReal Loading…
[DON'T MERGE] run CI run-ci-megatron
#2053 opened Jun 11, 2026 by zhuzilin Contributor Loading…
support --num-workers for dataset parallel loading
#2048 opened Jun 10, 2026 by demouo Contributor Loading…
[docs] Fix OPD reverse KL formula in docs
#2039 opened Jun 9, 2026 by zihaocheng-buaa Loading…
Support OPD when teacher tokenization differs
#2032 opened Jun 8, 2026 by hhnqqq Loading…
examples: add CISPO custom loss
#2026 opened Jun 6, 2026 by kekmodel Contributor Loading…
fix(colocate): derive num_gpus_per_node from actor_num_gpus_per_node
#2012 opened Jun 3, 2026 by aoshen02 Contributor Loading…
ProTip! Filter pull requests by the default branch with base:main.