pub fn _mm512_castsi512_si128(a: __m512i) -> __m128i
Cast vector of type __m512i to type __m128i. This intrinsic is only used for compilation and does not generate any instructions, thus it has zero latency.
Intel’s documentation