Skip to content

Pull requests: THUDM/slime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Support Qwen3.5 MoE INT4-QAT
#2156 opened Jun 30, 2026 by ShuZihan Loading…
4 of 9 tasks
[docker] Upgrade to sglang v0.5.14 run-ci-image
#2149 opened Jun 29, 2026 by zhuzilin Contributor Loading…
feat(p2p): add shard-level weight update with automatic broadcast fallback
#2146 opened Jun 29, 2026 by CalvinXKY Contributor Loading…
3 of 5 tasks
docs: fix dead examples/README link to low_precision
#2142 opened Jun 29, 2026 by aoshen02 Contributor Loading…
fix(examples): preserve geo3k response budget
#2140 opened Jun 27, 2026 by zhangdw156 Loading…
fix(examples): correct geo3k VLM default env
#2139 opened Jun 27, 2026 by zhangdw156 Loading…
docs(readme): add Dressage to Chinese ecosystem
#2138 opened Jun 27, 2026 by zhangdw156 Loading…
docs(examples): fix broken markdown links in rollout_buffer and examples
#2137 opened Jun 27, 2026 by CalvinXKY Contributor Loading…
docs(examples): list coding_agent_rl in examples/README
#2133 opened Jun 26, 2026 by aoshen02 Contributor Loading…
Skip entropy gradient computation when entropy_coef == 0
#2130 opened Jun 25, 2026 by CSUN1997 Loading…
Support partial rollout resume in Search-R1 example
#2128 opened Jun 23, 2026 by OLIVER-XYP Loading…
Reduce entropy logging memory when entropy coef is zero
#2127 opened Jun 23, 2026 by none0663 Contributor Loading…
Add test for megatron server run-ci-changed
#2123 opened Jun 23, 2026 by zhuzilin Contributor Loading…
fix(partial-rollout): cap max_new_tokens by prior response length
#2122 opened Jun 23, 2026 by none0663 Contributor Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.