How the performance of Sage Attention compares to that of FA3 on Hopper GPUs? #59

alexngng · 2024-12-03T02:46:42Z

Did not see that comparison in the paper.

jt-zhang · 2024-12-03T04:07:03Z

SageAttention is currently designed to accelerate attention mechanisms across various GPUs, unlike FA3, which is optimized exclusively for and can only run on Hopper GPUs. We plan to optimize for Hopper in the future.

alexngng · 2024-12-06T09:38:48Z

SageAttention is currently designed to accelerate attention mechanisms across various GPUs, unlike FA3, which is optimized exclusively for and can only run on Hopper GPUs. We plan to optimize for Hopper in the future.

Thanks for reply.

jason-huang03 added the enhancement New feature or request label Dec 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How the performance of Sage Attention compares to that of FA3 on Hopper GPUs? #59

How the performance of Sage Attention compares to that of FA3 on Hopper GPUs? #59

alexngng commented Dec 3, 2024

jt-zhang commented Dec 3, 2024

alexngng commented Dec 6, 2024

How the performance of Sage Attention compares to that of FA3 on Hopper GPUs? #59

How the performance of Sage Attention compares to that of FA3 on Hopper GPUs? #59

Comments

alexngng commented Dec 3, 2024

jt-zhang commented Dec 3, 2024

alexngng commented Dec 6, 2024