Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Commit29e2d8f

Browse files
achartierdominicshanshan
authored andcommitted
[None][feat] Pass KvCacheRetentionConfig to torch LlmRequest (NVIDIA#8634)
Signed-off-by: Aurelien Chartier <2567591+achartier@users.noreply.github.com>
1 parentc471e4a commit29e2d8f

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

‎tensorrt_llm/_torch/pyexecutor/llm_request.py‎

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -764,7 +764,8 @@ def executor_request_to_llm_request(
764764
cache_salt_id=executor_request.cache_salt_id,
765765
arrival_time=getattr(executor_request,"py_arrival_time",None),
766766
py_multimodal_data=getattr(executor_request,"py_multimodal_data",
767-
None))
767+
None),
768+
kv_cache_retention_config=executor_request.kv_cache_retention_config)
768769
ifchild_req_ids:
769770
forchild_idinchild_req_ids:
770771
llm_request.create_child_request(child_id)

0 commit comments

Comments
 (0)

[8]ページ先頭

©2009-2025 Movatter.jp