Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix prefix caching Bug:P0
#4700 opened Jun 23, 2026 by grimoire Collaborator Loading…
fix _reduce_split_kernel for triton 3.5.1 Bug:P1
#4696 opened Jun 22, 2026 by irexyc Collaborator Loading…
Optimize TTFT
#4695 opened Jun 22, 2026 by grimoire Collaborator Draft
1 task
Remove interactive chat and make inference stateless
#4694 opened Jun 22, 2026 by lvhan028 Collaborator Draft
chore: remove deprecated model support
#4693 opened Jun 22, 2026 by CUHKSZzxy Collaborator Draft
bump version to v0.14.0
#4689 opened Jun 18, 2026 by lvhan028 Collaborator Loading…
Support long-context and MTP prefix-cache hits
#4688 opened Jun 17, 2026 by grimoire Collaborator Loading…
fix: gate multimodal preprocessing concurrency
#4687 opened Jun 17, 2026 by CUHKSZzxy Collaborator Loading…
[Improve]: Remove dlblas from lmdeploy
#4682 opened Jun 16, 2026 by RunningLeon Collaborator Loading…
fix: parse multimodal tool messages Bug:P1
#4680 opened Jun 16, 2026 by CUHKSZzxy Collaborator Loading…
Batch invariant support PART1
#4666 opened Jun 10, 2026 by grimoire Collaborator Draft
refactor: unify interleaved MRoPE rotary embedding
#4644 opened Jun 3, 2026 by CUHKSZzxy Collaborator Draft
Add multimodal and preemption metrics
#4640 opened Jun 1, 2026 by CUHKSZzxy Collaborator Loading…
TEST: Improve tool test
#4632 opened May 28, 2026 by littlegy Contributor Loading…
Interleave long-context prefill chunks with decode
#4631 opened May 28, 2026 by grimoire Collaborator Loading…
1 task done
modify save model in lite module improvement
#4624 opened May 26, 2026 by 43758726 Contributor Loading…
feat(turbomind): support priority schedule policy
#4614 opened May 22, 2026 by 4mengy Loading…
3 of 4 tasks
perf: optimize guided decoding with xgrammar upgrade, batched API, and async D2H overlap
#4605 opened May 21, 2026 by windreamer Collaborator Loading…
1 of 4 tasks
Intern s2 preview lite awq fix bug
#4600 opened May 19, 2026 by 43758726 Contributor Loading…
[WIP]: Support reuse routed experts on eviction
#4599 opened May 19, 2026 by RunningLeon Collaborator Loading…
ProTip! What’s not been updated in a month: updated:<2026-05-23.