Saturday, July 15, 2023

SIMD - transpose

 Intel SIMD (AVX) transpose intrinsics


Intel® Intrinsics Guide

- _shuffle_* (mask)

- permute (mask)

- _unpacklo_, _unpackhi_

- _MM_TRANSPOSE4_PS & c++ - Fastest way to transpose 4x4 byte matrix - Stack Overflow


Example

https://godbolt.org/z/xzfPqnrdo