-
Notifications
You must be signed in to change notification settings - Fork 586
Pull requests: InternLM/lmdeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
support offloading weights & kv_cache for turbomind
enhancement
New feature or request
#3798
opened Jul 29, 2025 by
irexyc
Loading…
build(docker): Try to optimize docker
improvement
#3779
opened Jul 26, 2025 by
windreamer
Loading…
4 tasks done
Optimize create_model_inputs and schedule_decoding
improvement
#3766
opened Jul 24, 2025 by
grimoire
Loading…
fix: qwen3 nonstream parse with no or uncompleted think content
#3748
opened Jul 18, 2025 by
ywx217
Loading…
Update turbomind communication library
enhancement
New feature or request
#3736
opened Jul 16, 2025 by
lzhangzz
Loading…
[ascend] support lora
enhancement
New feature or request
#3715
opened Jul 7, 2025 by
tangzhiyi11
•
Draft
support sleep/wakeup for pt engine
enhancement
New feature or request
#3687
opened Jun 30, 2025 by
irexyc
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.