Not sure about GCC (Arduino) using DSP instructions, or by default, versus other compilers like IAR, Keil.
If you use floating point, the current Teensy's don't have that ARM feature. Next one will. May not matter to your app.
Don't most such transforms use a lot of transcendental functions...