pub fn _mm512_castps512_ps256(a: __m512) -> __m256
Cast vector of type __m512 to type __m256. This intrinsic is only used for compilation and does not generate any instructions, thus it has zero latency.
Intel’s documentation