Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

[Feature]: Detailed Usage Guide for AFD Support in TensorRT-LLM #9489

Open
Assignees
laikhtewari
Labels
Doc<NV>TRTLLM's textual/illustrative materials: API refs, guides, tutorials. Improvement & clarity.feature requestNew feature or request. This includes new model, dtype, functionality support
@Alan-D-Chen

Description

@Alan-D-Chen

🚀 The feature, motivation and pitch

First, we would like to express our gratitude to the engineers of TensorRT-LLM. They have brought tremendous surprises to the community and driven significant progress in both science and industry. However, AFD (Attention-FFN Disaggregation), which has garnered substantial attention within the community recently, is not yet supported by TensorRT-LLM.

I have carefully reviewed the TensorRT-LLM documentation and paid particular attention to relevant issues, arriving at the following conclusions:
The TensorRT-LLM framework does not yet fully support a mature and usable AFD (Attention-FFN Disaggregation) feature. However, experimental implementations for specific models already exist and are in the development and iteration phase. Key issues still need to be addressed before its official deployment.

We hope the community will extend more warmth and encouragement to the TensorRT-LLM developers to accelerate the realization of the AFD feature. It is my wish that this post can serve as a centralized hub for one-stop solutions related to the AFD feature, making it easier for everyone to access relevant resources. Finally, we wish the developers good health and smooth work.

Alternatives

No response

Additional context

No response

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and checked thedocumentation andexamples for answers to frequently asked questions.

Metadata

Metadata

Assignees

Labels

Doc<NV>TRTLLM's textual/illustrative materials: API refs, guides, tutorials. Improvement & clarity.feature requestNew feature or request. This includes new model, dtype, functionality support

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions


    [8]ページ先頭

    ©2009-2025 Movatter.jp