Lightnews — Scholar-powered news

larrytheliquid

@larrytheliquid.bsky.social

LLM's provide the automation of doing the translations between the languages, and formal verification checks their work.

March 19, 2025 at 3:06 PM

larrytheliquid

@larrytheliquid.bsky.social

The magic is formal proofs of semantics preservation when doing this translation, which is mediated by using Boogie as its intermediate representation language (amenable to formal verification via SMT solvers).

March 19, 2025 at 3:06 PM

larrytheliquid

@larrytheliquid.bsky.social

This MSR et. al. research is a great step in this direction. It shows how you can start with Python/Pandas as your unambiguous pseudocode intent language, and compile to inlined SparkSQL for performance.

Paper: dl.acm.org/doi/10.1145/...

QURE: AI-Assisted and Automatically Verified UDF Inlining | Proceedings of the ACM on Management of Data

User-defined functions (UDFs) extend the capabilities of SQL by improving code reusability and encapsulating complex logic, but can hinder the performance due to optimization and execution inefficienc...

dl.acm.org

March 19, 2025 at 3:06 PM

larrytheliquid

@larrytheliquid.bsky.social

1. Ambiguous but flexible natural language descriptions of intent
2. Unambiguous pseudocode to review and iterate on
3. Synthesis/compilation to your implementation language, formally verified to "refine" your pseudocode

QURE: AI-Assisted and Automatically Verified UDF Inlining | Proceedings of the ACM on Management of Data

User-defined functions (UDFs) extend the capabilities of SQL by improving code reusability and encapsulating complex logic, but can hinder the performance due to optimization and execution inefficienc...

dl.acm.org

March 19, 2025 at 3:06 PM

larrytheliquid

@larrytheliquid.bsky.social

thats cool! one other benefit (if youre into that kinda thing), is you get static artifacts in CI that are easy to feed into LLM's for debugging assistance too

March 5, 2025 at 10:55 PM

larrytheliquid

@larrytheliquid.bsky.social

nice, are there going to be any mendler-style encodings?

February 17, 2025 at 10:06 PM

larrytheliquid

@larrytheliquid.bsky.social

Interesting! Yeah I'm hoping that long term, LLM-powered auto-modeling/auto-formalization can finally make the fruits of modeling get absorbed by the world without needing to do the dirty work of planting the seeds etc :)

November 21, 2024 at 8:42 PM

larrytheliquid

@larrytheliquid.bsky.social

Thanks, originally it hurt to see ppl get excited about a part of a demo that i thought was just added value, compared to the parts that i was personally more invested in and were more difficult to make.

But once it happened enough times, it was too big to ignore from a business standpoint.

November 21, 2024 at 8:31 PM

larrytheliquid

@larrytheliquid.bsky.social

As our auto-modeling takes shape, we will expand from shallow bug fixing to deep bug fixing, using formal models as our technical leverage. Please join us by checking out the site, demo, or joining our discord :)

Site: colimit.io
Demo: www.loom.com/share/23aad3...
Discord: discord.gg/2jkP3m3Rhv 9/9

Colimit - Autofix Failed Builds

AI Assistance to help your GitHub Actions go from ❌ to ✅

colimit.io

November 21, 2024 at 6:49 PM

larrytheliquid

@larrytheliquid.bsky.social

Our current autofixer integrates with GitHub Actions and I'm excited to announce that we just launched it for Early Access!

Right now it's mostly useful for tedious but frequently occurring bugs, like linting failures or basic unit or integration test misaligned expectations. 8/9

November 21, 2024 at 6:49 PM

larrytheliquid

@larrytheliquid.bsky.social

The mission is the same, increasing software reliability by popularizing [semi]formal methods, but this prioritization allows it to grow as a business. 7/9

November 21, 2024 at 6:49 PM

larrytheliquid

@larrytheliquid.bsky.social

So we made a hard choice and reversed our roadmap:
1. First auto-fixing as as a service
2. Auto-modeling. Models act as "context compression" of a codebase, and can be checked for logical consistency w/o builds & running tests.
3. Reintroduce bug-finding, now palatable via auto-generated models. 6/9

November 21, 2024 at 6:49 PM

larrytheliquid

@larrytheliquid.bsky.social

Luckily, we noticed that during our demos one part of our product lit up people's eyes: The auto-generated "debugging notes", and the associated auto-fixer of the bugs that Colimit's bug-finder identified. With LLM's these days, this part is a lot easier to get off the ground. 5/9

November 21, 2024 at 6:49 PM

larrytheliquid

@larrytheliquid.bsky.social

For business uptake, any amount of extra work to be done quickly kills adoption/onboarding, even if the results are great. The answer was clear: Use LLM's to auto-model/auto-formalize: But that gives Colimit as a company even more work and less runway. 4/9

November 21, 2024 at 6:49 PM

larrytheliquid

@larrytheliquid.bsky.social

We found real bugs for our design partners, giving them room to refactor with confidence. At first we wrote the models for them, but when trying to hand off the modeling we learned a hard lesson. Even if modeling in our DSL is approx the same effort as writing tests, it's still something new. 3/9

November 21, 2024 at 6:49 PM

larrytheliquid

@larrytheliquid.bsky.social

The initial goal was to make model-checking and model-based testing (and even formalization) more widespread by creating an accessible TypeScript-like modeling language, paired with a cloud that executed verification and tests at scale. 2/9

November 21, 2024 at 6:49 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news