Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

AWS GPU Configuration#868

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Draft
hameerabbasi wants to merge2 commits intomain
base:main
Choose a base branch
Loading
fromaws-gpu
Draft

AWS GPU Configuration#868

hameerabbasi wants to merge2 commits intomainfromaws-gpu

Conversation

hameerabbasi
Copy link
Collaborator

@hameerabbasihameerabbasi commentedMay 5, 2025
edited
Loading

This PR adds support for the following features (onlyCOO):

  • Elemwise (if supported by CuPy)
  • Sum along an axis
  • Matmul
  • Converting to/fromcupyx.scipy.sparse matrices.
  • to_device with"cpu" and CuPy devices, onlystream=None,
  • Constructing from CuPy arrays.

@hameerabbasihameerabbasiforce-pushed theaws-gpu branch 2 times, most recently fromb99e7a5 to185c956CompareMay 5, 2025 08:14
@codspeed-hqCodSpeed HQ
Copy link

codspeed-hqbot commentedMay 5, 2025
edited
Loading

CodSpeed Performance Report

Merging#868 willdegrade performances by 97.81%

Comparingaws-gpu (55ae3bc) withmain (afb5212)

Summary

⚡ 10 improvements
❌ 151 regressions
✅ 179 untouched benchmarks

⚠️Please fix the performance issues oracknowledge them on CodSpeed.

Benchmarks breakdown

BenchmarkBASEHEADChange
test_elemwise[side=100-rank=1-format='coo'-add]2.9 ms3.7 ms-21.3%
test_elemwise[side=100-rank=1-format='coo'-mul]2.2 ms2.7 ms-20.45%
test_elemwise[side=100-rank=1-format='gcxs'-add]3.4 ms4.6 ms-26.65%
test_elemwise[side=100-rank=1-format='gcxs'-mul]2.7 ms3.7 ms-27.45%
test_elemwise[side=100-rank=2-format='coo'-add]3.3 ms4.1 ms-19.78%
test_elemwise[side=100-rank=2-format='coo'-mul]2.4 ms2.9 ms-17.56%
test_elemwise[side=100-rank=2-format='gcxs'-add]6.7 ms7.7 ms-13.02%
test_elemwise[side=100-rank=2-format='gcxs'-mul]5.8 ms6.5 ms-10.95%
test_elemwise[side=1000-rank=1-format='coo'-add]2.9 ms3.8 ms-22.4%
test_elemwise[side=1000-rank=1-format='coo'-mul]2.2 ms2.7 ms-20.34%
test_elemwise[side=1000-rank=1-format='gcxs'-add]3.4 ms4.7 ms-27.39%
test_elemwise[side=1000-rank=1-format='gcxs'-mul]2.7 ms3.7 ms-27.25%
test_elemwise[side=500-rank=1-format='coo'-add]2.9 ms3.8 ms-22.45%
test_elemwise[side=500-rank=1-format='coo'-mul]2.2 ms2.7 ms-20.39%
test_elemwise[side=500-rank=1-format='gcxs'-add]3.4 ms4.7 ms-27.43%
test_elemwise[side=500-rank=1-format='gcxs'-mul]2.7 ms3.7 ms-27.29%
test_elemwise[side=500-rank=2-format='coo'-add]7.1 ms8 ms-10.33%
test_elemwise[side=500-rank=2-format='coo'-mul]3.9 ms4.4 ms-11.76%
test_elemwise_broadcast[side=100-format='coo'-mul]2.6 ms3.2 ms-18.38%
test_elemwise_broadcast[side=100-format='gcxs'-mul]6.4 ms7.4 ms-13.71%
...............

ℹ️Only the first 20 benchmarks are displayed.Go to the app to view all benchmarks.

@hameerabbasihameerabbasiforce-pushed theaws-gpu branch 4 times, most recently from4e69a90 toae8e04fCompareMay 7, 2025 11:51
Sign up for freeto join this conversation on GitHub. Already have an account?Sign in to comment
Reviewers
No reviews
Assignees
No one assigned
Labels
None yet
Projects
None yet
Milestone
No milestone
Development

Successfully merging this pull request may close these issues.

1 participant
@hameerabbasi

[8]ページ先頭

©2009-2025 Movatter.jp