Phil Steitz
banner
psteitz.bsky.social
Phil Steitz
@psteitz.bsky.social
Animal lover, trail runner, mathematician, product and tech leader, open source developer
Yeah the “receiving” app is Java. Kind of comical that the Java code generated by the python generator is better than most of the Java code I have been able to get Claude to write :).
October 14, 2025 at 4:51 AM
Cf. OpenAI “Why LMs Hallucinate.” 70%+ calibration error rates on code generation tasks. Any distributional “intuition” is bound to fail. If we could hone intuition to predict errors, so could the LLMs. If the code matters, you need to look at it carefully.
October 11, 2025 at 8:00 PM