It seems the latest versions of Clang default to unrolling loops.
Even when -Os or -Oz is enabled (musl's default configuration)
This significantly increases code size on many functions.
I didn't do any bench-marking to determine how it affects speed.
It can be prevented by adding: -fno-unroll-loops

-R,
Brad Conroy