_mm256_permutevar8x32_epi32
Permutes doubleword elements of the source vector into the destination vector. The corresponding Intel® AVX2 instruction is VPERMD.
_mm256_permutevar8x32_ps
Permutes single-precision floating-point elements of the source vector into the destination vector. The corresponding Intel® AVX2 instruction is VPERMPS.
_mm256_permute4x64_epi64
Permutes quadword integer values of the source vector into the destination vector. The corresponding Intel® AVX2 instruction is VPERMQ.
_mm256_permute4x64_pd
Permutes quadword double-precision floating-point values of the source vector into the destination vector. The corresponding Intel® AVX2 instruction is VPERMPD.
_mm256_permute2x128_si256
Permutes 128-bit integer data from the first source vector and the second source vector in the destination vector. The corresponding Intel® AVX2 instruction is VPERM2I128.