Can you give me a few more details on the issues you're seeing w/ws2811 and the teensy 3 and fastspi? I'm currently working on trying to get the ws2811 timings solidified, the lack of solid docs are being a pain, but i'm narrowing things down.
Also - another thing to keep in mind - right now, the ws2811 code for both versions of the library halts interrupts, which means that, among other things, the clock stops running - so if you are using micros/millis to determine when to push frames, your timings are going to be off. I'm working on making it so that the library kicks the timers by the appropriate amount, but won't have that done for a little bit yet (my most immediate focus at the moment is cleaning up the teensy 3 SPI code and adding DMA).