pub fn _mm256_permutex_pd<const MASK: i32>(a: __m256d) -> __m256d
Shuffle double-precision (64-bit) floating-point elements in a within 256-bit lanes using the control in imm8, and store the results in dst.
Intel’s documentation