FFT c2c : interleaved in\out vs. non-interleaved #291

zviered · 2023-04-26T16:23:18Z

Hello,

I'm running signal processing algorithm under cortex A53.
The code is written with Intrinsic C.

I measured performance of Matrix multiply of complex matrix by scalar matrix, scalar multiply of complex float vector by complex float vector.
It seems that when in\out is interleaved (re0,im0,re1,im1...) the performance is lower compared to non-interleaved in\out.
In case of interleaved I'm using: vld2q_f32, vst2q_f32
In case of non-interleaved: vld1q_f32, vst1q_f32

Do you think it make sense to create a c2c that will get non-interleaved input ?

Thank you,
Zvika

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FFT c2c : interleaved in\out vs. non-interleaved #291

FFT c2c : interleaved in\out vs. non-interleaved #291

zviered commented Apr 26, 2023

FFT c2c : interleaved in\out vs. non-interleaved #291

FFT c2c : interleaved in\out vs. non-interleaved #291

Comments

zviered commented Apr 26, 2023