Movatterモバイル変換

InternLM/lmdeployPublic

NotificationsYou must be signed in to change notification settings
Fork655
Star7.6k

New pull requestNew

55 Open 2,021 Closed

Author

Label

Projects

Milestones

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Sort

fix: change debug log from ERROR to DEBUG in RepetitionPenaltyKernel

#4363 openedFeb 15, 2026 bymurray-macdonald

Loading…

GLM-4.7-Flash Turbomind support

#4362 openedFeb 13, 2026 bylapy

Loading…

2 of 4 tasks

fix fa3 install

#4361 openedFeb 13, 2026 byirexyc

Loading…

[WIP] Support video inputs

#4360 openedFeb 13, 2026 byCUHKSZzxy • Draft

fix ssm inputs merge

#4359 openedFeb 13, 2026 bygrimoire

Loading…

ci(lint): skip flaky deadlink test for python wiki page

#4357 openedFeb 13, 2026 bywindreamer

Loading…

support glm5

#4355 openedFeb 12, 2026 bygrimoire

Loading…

Improve proxy server improvement

#4354 openedFeb 12, 2026 bylvhan028

Loading…

Qwen3.5

#4351 openedFeb 11, 2026 bygrimoire

Loading…

Fix XGrammar bitmask initialization and add null check for gen_config in generate method

#4349 openedFeb 11, 2026 bywindreamer

Loading…

[WIP]: support glm4.7 with mtp WIP

#4346 openedFeb 10, 2026 byRunningLeon • Draft

Support MiniMax-M2 in TurboMind engine

#4343 openedFeb 10, 2026 byzh-nj

Loading…

Fix authorization Bug:P1

#4338 openedFeb 9, 2026 bylvhan028

Loading…

[WIP]Support torch compile

#4336 openedFeb 8, 2026 bygrimoire • Draft

add preliminary support for EP(single-node) of turbomind backend

#4332 openedFeb 6, 2026 byirexyc

Loading…

Qwen/Internlm/Llama Dense/Moe model fp8 quant online enhancement

New feature or request

#4324 openedFeb 5, 2026 by43758726

Loading…

Compatible with transformers 5.0 at TurboMind side improvement

#4304 openedJan 28, 2026 bylvhan028

Loading…

change ascend paged attention from BSH format to TND format for better performace

#4295 openedJan 27, 2026 byjinminxi104 • Draft

return BadRequest for all invlid inputs Bug:P2

#4291 openedJan 26, 2026 bylvhan028

Loading…

support repetition ngram logits processor enhancement

New feature or request

#4288 openedJan 23, 2026 bygrimoire

Loading…

fix dllm mask on set_step

#4278 openedJan 18, 2026 bygrimoire

Loading…

[ascend] fix awq and smoothq

#4277 openedJan 16, 2026 bywanfengcxz • Draft

Update benchmark serving script for proxy_server

#4173 openedDec 1, 2025 bylvhan028

Loading…

[WIP]: Support prefix caching with routed experts

#4171 openedNov 28, 2025 byRunningLeon • Draft

Support fp32 head for qwen and internlm models improvement

#4160 openedNov 27, 2025 byRunningLeon

Loading…

ProTip! Typegp on any issue or pull request to go back to the pull request listing page.

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Movatterモバイル変換

Pull requests: InternLM/lmdeploy

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Pull requests list