pub fn _mm_permutexvar_epi16(idx: __m128i, a: __m128i) -> __m128i
Shuffle 16-bit integers in a across lanes using the corresponding index in idx, and store the results in dst.
Intel’s documentation