The CS pins on T4.x are not like 3.x in that is you select one by index, not one or more by mask. So changed returned values to be index (1 based) Also while testing I pulled in Frank's change for adding delay after CS pin is logically selected.
http://www.pjrc.com/teensy/td_libs_SPI.html