TEST: Rework unit tests
All outputs generated from the MNI152 are identical across the three platform, so there is no need to store per-platform benchmark data. For the other tests, use the same RNG on all platforms to ensure that results are consistent, and evaluate results against equivalent operations in Python.