Note, the same library SdFat is in use with Teensyduino.
Using DMA requires still to wait for conclusion of read. Not sure if SdFat makes it easy the do this outside Library.
I know ELM CHaN's...