Lightnews — Scholar-powered news

elucent

@elucent.website

mischievous finding from a conversation in plcord today: zig's fancy new compiler backend - the entire point of which is to compile faster than the existing LLVM one - takes more than *twice as long* as clang -O0 in total user time 🙃

benchmark is 10k quicksort functions

clover ofc trashes both 😎

October 5, 2025 at 2:40 AM

elucent

@elucent.website

an underground explorer and her vehicle come face to face with a ne'er do well

July 29, 2025 at 6:14 AM

elucent

@elucent.website

was compelled to draw tenna

pixel art of tenna from deltarune chapter 3. one of the mikes is also there and just dropped the pipis

July 15, 2025 at 5:11 AM

elucent

@elucent.website

it's so funny to me how intense the scala community is. imagine making a reality show about clang libstdc++ maintainers and buying twitter ads for it, and that the reason you would do this is to fight your personal culture war against the gcc libstdc++ maintainers. that's how scala is in real life

June 30, 2025 at 3:32 AM

elucent

@elucent.website

and now clover has full support for array/slice patterns too! these even support multiple splat parameters (i.e. rest..., body...) per pattern, which i used to implement this cute BF interpreter :)

June 15, 2025 at 8:55 AM

elucent

@elucent.website

added codegen support for swizzling and tuple patterns

June 15, 2025 at 6:55 AM

elucent

@elucent.website

whew. had to make a surprising number of backend changes for this, but Clover has sum types and pattern matching now :)

June 7, 2025 at 11:52 PM

elucent

@elucent.website

post wpm

May 31, 2025 at 7:15 PM

elucent

@elucent.website

wait a fucking minute V DCE's every function even if they're public lmao??? i have to compile a shared library to get them to stick in which case it takes about 50% longer. the TinyCC component would still beat everything else but not by as huge of margins, it's about the same as codegen in clover

May 31, 2025 at 6:55 AM

elucent

@elucent.website

V doing as badly as it did was also pretty surprising since my understanding is TinyCC is actually very well optimized. a quick look at its flamegraph though shows almost all the time is in the V compiler itself, in parsing of all things lol. if i tested TinyCC on its own it would beat clover by 10x

May 31, 2025 at 6:44 AM

elucent

@elucent.website

here's the memory usage over time of each process too. clang apparently also has the lowest overall memory usage?? clover is pretty close, V is off by a factor of 2, hare is in the stratosphere (this is probably why it takes so much system time)

May 31, 2025 at 6:44 AM

elucent

@elucent.website

one last indulgent perf comparison for today: tried this same benchmark (10k quicksort functions) in a couple other systems languages. clover wins by a lot, but honestly clang did a lot better than i thought it would

May 31, 2025 at 6:44 AM

elucent

@elucent.website

aaaaaand this was in fact pretty easy to optimize, it's now about 5x faster >:)

May 31, 2025 at 4:09 AM

elucent

@elucent.website

did a pretty simple change and nuked the entire SCC resolution part of this lol. the compiler is now 30% more faster

May 31, 2025 at 3:50 AM

elucent

@elucent.website

finally, the biggest shock: 24.25% of the compile time is just formatting the ELF object! and of that, 16.93% is just putting symbol definitions into a hash table! i haven't tried to optimize this at all so it's not surprising, probably a lot can be done to improve here

[10/12]

May 31, 2025 at 3:18 AM

elucent

@elucent.website

next up is IR lowering. i collected these numbers without opts, so the bulk of the time is spent simply lowering IR instructions (8.09%) and generating assembly (6.10%). i suspect doing basic register allocation would actually improve this by reducing the instruction count

[9/12]

May 31, 2025 at 3:18 AM

elucent

@elucent.website

next step after typechecking (for now) is IR generation, which takes up 7.88% of total runtime. about 0.72% comes from the initial allocation of each IR function, with defining IR variables and generating conditional branches seemingly taking up a fair amount of time throughout the tree

[8/12]

May 31, 2025 at 3:18 AM

elucent

@elucent.website

now for the big stuff: 33.29% of runtime in type inference. 10.83% of this is in the initial discovery pass, 3.64% in the later refinement pass, and a shocking 14.78% just to resolve SCCs in the constraint graph! i bet this can be optimized, my data structures aren't tailored for large input

[6/12]

May 31, 2025 at 3:18 AM

elucent

@elucent.website

next we have scope resolution, which takes up 3.02% of total runtime. i'm a little surprised we spend about 1.82% of this just traversing the tree! we do another tree traversal to late-resolve access chains which accounts for another 0.44%, and i left validation on which is another 0.55%

[5/12]

May 31, 2025 at 3:18 AM

elucent

@elucent.website

next we create the scope tree, which takes 1.16% of total compilation. interestingly a lot of what we spend time doing in this function is just handling page faults. i think this is actually a sign the pass is so cheap, most of the cost is just paging in memory to store the results

[4/12]

May 31, 2025 at 3:18 AM

elucent

@elucent.website

next is parsing, which occupies 2.08% of total runtime. again i think nothing too surprising, we mostly parse statements, and the call stacks just follow the recursive structure of the source program

[3/12]

May 31, 2025 at 3:18 AM

elucent

@elucent.website

first is lexing, which takes up 2.97% of total compiler runtime. honestly not too much to see here! we spend most of our time (1.24%) lexing identifiers, 0.22% lexing numbers, 0.64% allocating storage for the token vector, and 0.15% handling indents and dedents

[2/12]

May 31, 2025 at 3:18 AM

elucent

@elucent.website

took some time to fix some bugs and stress test the compiler, to the end of being able to measure how good the new type inference is for compile times!

short answer, pretty darn good! figured it might be fun to write a thread breaking down this flamegraph by compiler pass :)

[1/12]

May 31, 2025 at 3:18 AM

elucent

@elucent.website

type constraint graph for a growable vector of 32-bit integers

mostly i just get a kick out of making graphs

May 27, 2025 at 5:28 AM

elucent

@elucent.website

new type inference algorithm has reached parity. now i get to break it again :)

May 27, 2025 at 5:09 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news