Post not yet marked as solved
Post marked as unsolved with 0 replies, 752 views
I was build SSE performance work on mac intel. But I found the SSE4.1 version of performance in xcode 12.4 is not as good as xcode 10.1, so I checked the assembly of my code. The one _mm_mul_epi() was translated into three pmuludq, which is the SSE2 instruction.This was normal when compiling on xcode 10.1 and _mm_mul_epi() was translated into pmuldq. Does anyone know how to fix this issue?