Experiment with making the whole transfer of the buffer act like it is to logically hold the CS pins for that whole transfer. This removes gaps between each 16 bit transfer.
http://www.pjrc.com/teensy/td_libs_SPI.html