pub fn _mm256_permutexvar_epi16(idx: __m256i, a: __m256i) -> __m256i
Shuffle 16-bit integers in a across lanes using the corresponding index in idx, and store the results in dst.
Intel’s documentation