It seems the latest versions of Clang default to unrolling loops.Even when -Os or -Oz is enabled (musl's default configuration)This significantly increases code size on many functions.I didn't do any bench-marking to determine how it affects speed.It can be prevented by adding: -fno-unroll-loops -R,Brad Conroy