Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

ROCm support#295

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Open
fxmarty wants to merge21 commits intomain
base:main
Choose a base branch
Loading
fromrocm-support
Open

ROCm support#295

fxmarty wants to merge21 commits intomainfromrocm-support

Conversation

fxmarty
Copy link
Contributor

@fxmartyfxmarty commentedJun 19, 2024
edited
Loading

As per title. Support AMD GPUs through TEI backend.

For now, only embedding model with cls/mean pooling is tested.

MI210/MI250/MI300 can dispatch on CK flash attention 2, but other GPUs will default to manual attention implem (or SDPA). Only bert looks to be supported in the python backend.

YevheniiSemendiak reacted with thumbs up emoji
@fxmartyfxmarty mentioned this pull requestJun 19, 2024
@HuggingFaceDocBuilderDev

The docs for this PR livehere. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Copy link
Contributor

@OlivierDehaeneOlivierDehaene left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Thanks!

@@ -0,0 +1,34 @@
## Testing
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I think this is equivalent to the integration snapshoting logic inrouter/tests?

fxmarty reacted with thumbs up emoji
model = AutoModel.from_pretrained(model_path).to(dtype).to(device)
self.hidden_size = model.config.hidden_size
self.pooling_mode = pooling_mode
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Can you add mean pooling / CLS pooling forking L46?

Copy link
ContributorAuthor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

CLS pooling was already there, I added mean pooling.

What is L46?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

@OlivierDehaene, I created apr to fix this

@almersawi
Copy link

any chance this MR is merged soon?

@fxmarty
Copy link
ContributorAuthor

@almersawi it is in good shape to me.

cc@OlivierDehaene

almersawi, zhuhaozhe, and brunomoraisnc reacted with eyes emoji

@baddoub
Copy link

baddoub commentedJul 19, 2024
edited
Loading

Hey guys! thanks for this MR. We have been waiting for it for some time now. Any idea when this MR  will be merged ?

@ocnimesh
Copy link

Can please anyone provide build steps for docker image with this branch.
I am getting bewlow error . Is there any pre-requisite packages?

/opt/conda/lib/python3.10/site-packages/torch/include/ATen/hip/HIPContextLight.h:20:10: fatal error:'hipsolver/hipsolver.h' file not found
...
...
Dockerfile-rocm:112
112 | >>> RUN make -f Makefile-flash-att-v2 install-flash-attention-v2-rocm
...
...
ERROR: failed to solve: process "/bin/sh -c make -f Makefile-flash-att-v2 install-flash-attention-v2-rocm" did not complete successfully: exit code: 2

Command I executed :sudo docker build -f Dockerfile-rocm -t nims123/tei_amd .

@nbroad1881nbroad1881 mentioned this pull requestSep 10, 2024
@mht-sharma
Copy link

Applied minor fixes to successfully build the Docker image. PR#403

@nbroad1881@OlivierDehaene

@sauravsit
Copy link

any ETA on this merge?

@nbroad1881
Copy link

@OlivierDehaene,@mht-sharma,

Is there anything that I can do to help unblock this?

Co-authored-by: root <root@tw022.pit.tensorwave.lan>
@almersawi
Copy link

any updates?

Sign up for freeto join this conversation on GitHub. Already have an account?Sign in to comment
Reviewers

@nbroad1881nbroad1881nbroad1881 left review comments

@OlivierDehaeneOlivierDehaeneAwaiting requested review from OlivierDehaene

At least 1 approving review is required to merge this pull request.

Assignees
No one assigned
Labels
None yet
Projects
None yet
Milestone
No milestone
Development

Successfully merging this pull request may close these issues.

9 participants
@fxmarty@HuggingFaceDocBuilderDev@almersawi@baddoub@ocnimesh@mht-sharma@sauravsit@nbroad1881@OlivierDehaene

[8]ページ先頭

©2009-2025 Movatter.jp