Mi50 Support #29

YehowshuaScaled · 2023-12-31T06:12:24Z

I was able to build flash-attention ROCM for both my Mi100 and Mi50 cards, but only got flash attention working on the Mi100(very impressive performance I might add).

Trying to run flash attention on the Mi50 delivered the following error:
RuntimeError: DeviceGroupedMultiheadAttentionForward_Xdl_CShuffle_V2<256, 128, 128, 32, 8, 8, 128, 128, 32, 2, Default, ASpecDefault, B0SpecDefault, B1SpecDefault, CSpecDefault, MaskUpperTriangleFromTopLeft> does not support this problem

How hard would it be to port FA to the Mi50. Happy to pay/hire for support on this as I have a rather large stockpile of Mi50s.

dejay-vu · 2024-01-23T10:23:38Z

Hi @YehowshuaScaled. I think it would be better to ask the CK team to see if they are going to support MI50. It won't be an issue if they have FA kernels running on MI50.

#RuntimeError: DeviceGroupedMultiheadAttentionForward_Xdl_CShuffle_V2<256, 128, 128, 32, 8, 8, 128, 128, 32, 2, Default, ASpecDefault, B0SpecDefault, B1SpecDefault, CSpecDefault, MaskUpperTriangleFromTopLeft> does not support this problem

This error is actually raised from the CK backend.

differentprogramming · 2024-06-30T12:17:03Z

I notice this line in setup.py
allowed_archs = ["native", "gfx90a", "gfx908", "gfx940", "gfx941", "gfx942"]
I'm sad that gfx906 isn't there since I have an MI50 as well.

linchen111 · 2024-07-29T08:08:14Z

I was able to build flash-attention ROCM for both my Mi100 and Mi50 cards, but only got flash attention working on the Mi100(very impressive performance I might add).我能够为我的 Mi100 和 Mi50 卡构建 flash-attention ROCM，但只在 Mi100 上实现 flash-attention（我可能会补充非常令人印象深刻的性能）。

Trying to run flash attention on the Mi50 delivered the following error:尝试在 Mi50 上运行 Flash Attention 时出现以下错误： RuntimeError: DeviceGroupedMultiheadAttentionForward_Xdl_CShuffle_V2<256, 128, 128, 32, 8, 8, 128, 128, 32, 2, Default, ASpecDefault, B0SpecDefault, B1SpecDefault, CSpecDefault, MaskUpperTriangleFromTopLeft> does not support this problem运行时错误：DeviceGroupedMultiheadAttentionForward_Xdl_CShuffle_V2<256 , 128, 128, 32, 8, 8, 128, 128, 32, 2, Default, ASpecDefault, B0SpecDefault, B1SpecDefault, CSpecDefault, MaskUpperTriangleFromTopLeft>不支持这个问题

How hard would it be to port FA to the Mi50. Happy to pay/hire for support on this as I have a rather large stockpile of Mi50s.将 FA 移植到 Mi50 上有多难？很高兴支付/雇用这方面的支持，因为我有相当大的 Mi50 库存。

did you solve this?

Said-Akbar · 2024-10-31T15:09:54Z

Hello @YehowshuaScaled ,

Did you find a solution for it? I have 2x MI60 cards.

Said-Akbar · 2024-10-31T15:15:41Z

Hi @jayz0123 ,

How hard is it to implement FA kernels for MI60? Can you please point to the relevant scripts and documentation to make the changes? What knowledge is required to implement FA2 to MI60? Is it only dependent on Composable Kernel repo support?
Quick look at CK repo shows they support Mi60 (gfx906): https://github.com/search?q=repo%3AROCm%2Fcomposable_kernel%20gfx906&type=code
However, official doc states they only support gfx908 and up - https://rocm.docs.amd.com/projects/composable_kernel/en/latest/tutorial/tutorial_hello_world.html#hardware-targets
I want to explore this and make changes myself if it is not very complex (and does not require architecture knowledge)

sabreshao added the pre-mi200 label Jan 16, 2024

ThePerfectComputer mentioned this issue Jan 23, 2024

[Issue]: Flash Attention Failure on AMD Mi50 ROCm/composable_kernel#1140

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mi50 Support #29

Mi50 Support #29

YehowshuaScaled commented Dec 31, 2023

dejay-vu commented Jan 23, 2024 •

edited

Loading

differentprogramming commented Jun 30, 2024

linchen111 commented Jul 29, 2024

Said-Akbar commented Oct 31, 2024

Said-Akbar commented Oct 31, 2024 •

edited

Loading

Mi50 Support #29

Mi50 Support #29

Comments

YehowshuaScaled commented Dec 31, 2023

dejay-vu commented Jan 23, 2024 • edited Loading

differentprogramming commented Jun 30, 2024

linchen111 commented Jul 29, 2024

Said-Akbar commented Oct 31, 2024

Said-Akbar commented Oct 31, 2024 • edited Loading

dejay-vu commented Jan 23, 2024 •

edited

Loading

Said-Akbar commented Oct 31, 2024 •

edited

Loading