Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Update sdpa function with enable_gqa=True#191

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Open
jainapurva wants to merge2 commits intomain
base:main
Choose a base branch
Loading
fromgqa_support

Conversation

@jainapurva
Copy link

For the llama model, in the sdpa function call, set enable_gqa=True to use the inbuilt grouped query attention functionality

@facebook-github-botfacebook-github-bot added the CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. labelJul 13, 2024
@jainapurvajainapurva requested a review fromdrisspgJuly 13, 2024 03:56
@yanboliang
Copy link
Contributor

yanboliang commentedJul 26, 2024
edited
Loading

I think we should wait a bit to get this in, since a lot of users are still using the old version of PT which doesn't supportenable_gqa. But I'm interested how much perf gain we have after enabling the builtin gqa, do you have numbers on A100?

drisspg reacted with thumbs up emoji

Sign up for freeto join this conversation on GitHub. Already have an account?Sign in to comment

Reviewers

@drisspgdrisspgAwaiting requested review from drisspg

Assignees

No one assigned

Labels

CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.

Projects

None yet

Milestone

No milestone

Development

Successfully merging this pull request may close these issues.

4 participants

@jainapurva@yanboliang@facebook-github-bot

[8]ページ先頭

©2009-2025 Movatter.jp