Implements lane-local byte swapping through vector shuffles. While this is more setup than non-vector shuffles, this implementation can shuffle multiple integers concurrently. Signed-off-by: Andy Sadler <andrewsadler122@gmail.com>
11a0cceab9
1bbee3e217
CoverageInfoMethods