pub fn _mm_mask_blend_ps(k: u8, a: __m128, b: __m128) -> __m128
Blend packed single-precision (32-bit) floating-point elements from a and b using control mask k, and store the results in dst.
Intel’s documentation