pub fn _mm256_permutexvar_ps(idx: __m256i, a: __m256) -> __m256
Shuffle single-precision (32-bit) floating-point elements in a across lanes using the corresponding index in idx.
Intel’s documentation