Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

[WIP] Reduce the number of fa rows for Intel#18138

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Draft
mmerecki wants to merge1 commit intoggml-org:master
base:master
Choose a base branch
Loading
frommmerecki:reduce-fa-num-rows

Conversation

@mmerecki
Copy link

Reduce the number of fa rows for Intel to reduce registers usage.

@mmereckimmerecki changed the titleReduce the number of fa rows for Intel[WIP] Reduce the number of fa rows for IntelDec 17, 2025
@jeffbolznv
Copy link
Collaborator

Should this depend on head size? Some models have small head sizes like 64 or even 40, 2 rows seems pretty small for that. But if 2 is best, I don't object.

@github-actionsgithub-actionsbot added VulkanIssues specific to the Vulkan backend ggmlchanges relating to the ggml tensor library for machine learning labelsDec 17, 2025
@mmerecki
Copy link
Author

Thanks Jeff. I will verify this change with more models and potentially update the value for small head sizes.
I will also add information about the test results before I make the PR ready for review.

Sign up for freeto join this conversation on GitHub. Already have an account?Sign in to comment

Reviewers

@0cc4m0cc4mAwaiting requested review from 0cc4m0cc4m will be requested when the pull request is marked ready for review0cc4m is a code owner

At least 1 approving review is required to merge this pull request.

Assignees

No one assigned

Labels

ggmlchanges relating to the ggml tensor library for machine learningVulkanIssues specific to the Vulkan backend

Projects

None yet

Milestone

No milestone

Development

Successfully merging this pull request may close these issues.

2 participants

@mmerecki@jeffbolznv

[8]ページ先頭

©2009-2025 Movatter.jp