forked fromtriton-lang/triton
- Notifications
You must be signed in to change notification settings - Fork29
Tags: ROCm/triton
Tags
ifu-231117-2
Merge pull request#410 from ROCmSoftwarePlatform/ifu-231117Ifu 231117
ifu-231117-prev
add bitcode for gfx941 and gfx942 (#403)Co-authored-by: Aleksandr Efimov <130555951+alefimov-amd@users.noreply.github.com>
ifu-231108
Merge pull request#395 from ROCmSoftwarePlatform/ifu-231108Ifu 231108
ifu-231108-prev
[Tutorial] Fix post IFU issues with FA (#398)* [Tutorial] Fix post IFU issues with FA* Remove redundant kernels in 06-fused-attention.py* Added README for scripts in perf-kernels dir* Fix bwd kernel---------Co-authored-by: Lixun Zhang <lixun.zhang@amd.com>
ifu-231005
Merge pull request#382 from ROCmSoftwarePlatform/ifu231005-rebaseIfu231005
ifu-231005-prev
Add OptimizeEpilogue pass. (#346)* optimize_epilogue* Add config* Remove licenses* Comment out Hopper specific parameters when printing out configs* Add benchmark parameters from flash-attention repo* Add Z and H in the key of autotuner---------Co-authored-by: Lixun Zhang <lixun.zhang@amd.com>
third-party-merge-prev
use different int8 mfma instructions on different GPUs. (#368)* changes support to choose different int8 instructions* rename an instruction nameCo-authored-by: Aleksandr Efimov <efimov.alexander@gmail.com>
third-party-merge
Merge pull request#363 from ROCmSoftwarePlatform/post_ifu_rebase_emp……ty_kernel_worksThird Party Backend Merge
ifu-230908
Merge pull request#347 from ROCmSoftwarePlatform/ifu230908-2Ifu230908 2
ifu-230908-prev
[Stream] Fixed bug in stream-pipeline for FA (#345)* [Stream] Fixed bug in stream-pipeline for FA* updated gemm tutorial for num_stages=0* * updated configs
PreviousNext