Movatterモバイル変換

NotificationsYou must be signed in to change notification settings
Fork29
Star113

ifu-231117-2

Toggle ifu-231117-2's commit message

Merge pull request#410 from ROCmSoftwarePlatform/ifu-231117Ifu 231117

Dec 15, 2023
29847e9
zip
tar.gz

ifu-231117-prev

Toggle ifu-231117-prev's commit message

add bitcode for gfx941 and gfx942 (#403)Co-authored-by: Aleksandr Efimov <130555951+alefimov-amd@users.noreply.github.com>

Dec 14, 2023
521f425
zip
tar.gz

ifu-231108

Toggle ifu-231108's commit message

Merge pull request#395 from ROCmSoftwarePlatform/ifu-231108Ifu 231108

Nov 17, 2023
e1513b3
zip
tar.gz

ifu-231108-prev

Toggle ifu-231108-prev's commit message

[Tutorial] Fix post IFU issues with FA (#398)* [Tutorial] Fix post IFU issues with FA* Remove redundant kernels in 06-fused-attention.py* Added README for scripts in perf-kernels dir* Fix bwd kernel---------Co-authored-by: Lixun Zhang <lixun.zhang@amd.com>

Nov 14, 2023
5b06b16
zip
tar.gz

ifu-231005

Toggle ifu-231005's commit message

Merge pull request#382 from ROCmSoftwarePlatform/ifu231005-rebaseIfu231005

Nov 7, 2023
3c1fe61
zip
tar.gz

ifu-231005-prev

Toggle ifu-231005-prev's commit message

Add OptimizeEpilogue pass. (#346)* optimize_epilogue* Add config* Remove licenses* Comment out Hopper specific parameters when printing out configs* Add benchmark parameters from flash-attention repo* Add Z and H in the key of autotuner---------Co-authored-by: Lixun Zhang <lixun.zhang@amd.com>

Nov 3, 2023
c65f1e6
zip
tar.gz

third-party-merge-prev

Toggle third-party-merge-prev's commit message

use different int8 mfma instructions on different GPUs. (#368)* changes support to choose different int8 instructions* rename an instruction nameCo-authored-by: Aleksandr Efimov <efimov.alexander@gmail.com>

Oct 26, 2023
2729ae6
zip
tar.gz

third-party-merge

Toggle third-party-merge's commit message

Merge pull request#363 from ROCmSoftwarePlatform/post_ifu_rebase_emp……ty_kernel_worksThird Party Backend Merge

Oct 26, 2023
26debc9
zip
tar.gz

ifu-230908

Toggle ifu-230908's commit message

Merge pull request#347 from ROCmSoftwarePlatform/ifu230908-2Ifu230908 2

Oct 5, 2023
be95edc
zip
tar.gz

ifu-230908-prev

Toggle ifu-230908-prev's commit message

[Stream] Fixed bug in stream-pipeline for FA (#345)* [Stream] Fixed bug in stream-pipeline for FA* updated gemm tutorial for num_stages=0* * updated configs

Sep 30, 2023
287b0ad
zip
tar.gz

PreviousNext

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ifu-231117-2

Verified

ifu-231117-prev

Verified

ifu-231108

Verified

ifu-231108-prev

Verified

ifu-231005

Verified

ifu-231005-prev

Verified

third-party-merge-prev

Verified

third-party-merge

Verified

ifu-230908

Verified

ifu-230908-prev

Verified

Movatterモバイル変換

Tags: ROCm/triton