Skip to content

Pull requests: lightseekorg/tokenspeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix gathered MXFP4 activation scales in Gluon MoE
#534 opened Jun 26, 2026 by qedawkins Contributor Loading…
test: glm-5.2 agentic bench
#532 opened Jun 26, 2026 by syuoni Member Draft
[WIP] Initial glm 5.2 support on amd
#528 opened Jun 26, 2026 by borontion Contributor Draft
[WIP] feat:support qwen3.5 dflash
#510 opened Jun 24, 2026 by minedec Contributor Draft
test(agentic): add EvalScope trie benchmark protocol
#466 opened Jun 17, 2026 by Xiangyi1996 Collaborator Draft
test(ci): add DeepSeek-V4-Flash MTP AIME25 eval
#461 opened Jun 16, 2026 by dongjiyingdjy Contributor Loading…
test: add dp4ep4 case in CI
#453 opened Jun 15, 2026 by tuanzhangCS Contributor Draft
[WIP] Refactor Cache Management
#447 opened Jun 15, 2026 by wangbo981016 Contributor Draft
Fix EP8 DP/TP RSAG init and empty LM head
#416 opened Jun 11, 2026 by yubofredwang Contributor Loading…
perf(gdn): fuse causal_conv1d and QKV split for GDN prefill
#382 opened Jun 8, 2026 by elwhyjay Contributor Loading…
Add Triton sampling backends alongside FlashInfer inactive
#280 opened May 27, 2026 by FlamingoPg Contributor Loading…
feat(trtllm-MHA): support mixed prefill/decode batches
#176 opened May 18, 2026 by rjzhb Collaborator Draft
4 tasks done
ProTip! Follow long discussions with comments:>50.