pub fn _mm512_castpd512_pd128(a: __m512d) -> __m128d
Cast vector of type __m512d to type __m128d. This intrinsic is only used for compilation and does not generate any instructions, thus it has zero latency.
Intel’s documentation