pub fn _mm256_rol_epi64<const IMM8: i32>(a: __m256i) -> __m256i
Rotate the bits in each packed 64-bit integer in a to the left by the number of bits specified in imm8, and store the results in dst.
Intel’s documentation