Apple Clang generating incorrect SIMD code

Question

Created Dec ’24

Replies 4

Boosts 2

Participants 3

I have an M2 Mac Mini with Apple Clang 16.0.0. Under certain circumstances, the SIMD code generated by an unrolled loop is incorrect.

I have a short example program which reproduces the bug, on my machine and someone else's with the same Clang version. The core operation is this:

	for (size_t i = 0; i < count; ++i) {
		c[i] = a[i]*std::conj(b[i]);
	}

This loop gets unrolled to process 4 elements at once, and when count=15, the first 12 results have the wrong sign for the imaginary part. The final 3 elements are correct, since those are processed in a different code path.

Is this an known error? I suspect it might be present in other Apple Clang versions as well (because I found this while chasing down an extremely unpredictable bug) but so far this is the only setup where I've cleanly reproduced it.

Minimal test program (43 lines): https://signalsmith-audio.co.uk/tmp/argh.git/ - just run make.

The expected output is a bunch of error=0, or small values from floating-point errors.

I'm getting results like error=0.229711, and you can see it's because the "actual" results have a ± error.

Boost

Answer 1

endecotp OP

Dec ’24

Possibly related? :

https://developer.apple.com/forums/thread/766718 https://developer.apple.com/forums/thread/766030

2

Answer 2

UrsaDSP OP

Dec ’24

To confirm, I can reproduce this on another machine with a different processor and have tested using different compilers.

Apple clang 15.0.0 reports no errors clang version 19.1.5 (via llvm via brew install llvm) reports tiny errors as you might expect with relaxed float compliance. Apple clang 16.0.0 is wildly different (imaginary portion have their sign flipped).

Would be super nice if Apple could comment on this and what the timeline might be for a fix.

Thanks!

1

Answer 3

endecotp OP

Dec ’24

The first link I posted suggests this; did you try it?

-mllvm -enable-constraint-elimination=0

0

Answer 4

UrsaDSP OP

Dec ’24

I have tried running it like this to no avail.

clang++ -std=c++11 -O3 -ffast-math -mllvm -enable-constraint-elimination=0 \
        main.cpp -o out/main

Note, I had to chang @Signalsmith 's original makefile from g++ to clang++ to allow me to also test clang 19 as installed by brew.

This image should show the difference(s) between Apple Clang 15, 16 and a baseline Clang.

1