Max Caldwell
banner
growth.wtf
Max Caldwell
@growth.wtf
Bootstrapping a software company. In my free time I try to do things with data and statistics, or post random thoughts here.
0/3 + "you need help"
January 23, 2025 at 3:47 AM
As expected
January 23, 2025 at 3:46 AM
"reasoning"
January 23, 2025 at 3:45 AM
????
January 23, 2025 at 3:44 AM
It's crazy to me just how consistently wrong the google search llm is.

Is this like, a quantized gemini 1 that can't even see the search results it is summarizing? That's the only explanation for why doesn't it seem to get even context from featured snippets that they should have cached already.
January 1, 2025 at 11:17 PM
:/
December 12, 2024 at 4:10 AM
Sure, here's an example:
huggingface.co/datasets/Ple...

I am no expert but it seems legit?
December 6, 2024 at 12:11 AM
Something I've always been curious about with social networks is how much different actions lead to an audience. At scale, it would be interesting to learn how large accounts got large, but I'm also curious what gets people to follow. Is seeing a good post more influential? Or follow-backs? Well...
December 5, 2024 at 8:49 PM
From 2022 - tsunami danger zones in the bay area per the mercury news: www.mercurynews.com/2022/10/12/m...
December 5, 2024 at 7:22 PM
@alpindale.bsky.social Getting there, not quite getting the model injection code right yet.

I have to start my actual work now but when or if finish this I'll open source.
December 5, 2024 at 3:59 PM
OK, got a little DND game running with github.com/ag2ai/ag2 -- my review is that it's a neat set of concepts but very difficult to use and quite buggy. and I was hard-pressed to think of an actual use case that would fit into this.
December 5, 2024 at 8:09 AM
Hey @alpindale.bsky.social , I saw you're working on the ruliad project? Can you help explain how to get the model to follow scripted directions? I'm trying to write an ollama template to try to get it to use tools, or follow scripted tasks of any order.
December 5, 2024 at 6:50 AM
deepthoughts-8b-tools is confused, but it's got the spirit?
December 5, 2024 at 6:17 AM
Unfortunately it's just not trained for this. Even if you get it working with Ollama tool calling, the model is providing a total hallucination about half the time. and is unable to provide responses in standard tool calling format.
December 5, 2024 at 5:47 AM
Can I fix it?
December 5, 2024 at 4:46 AM
Tell me you're overthinking it without telling me you're overthinking it. Full reasoning chain in the alt text.
December 5, 2024 at 4:21 AM
it's happening
December 5, 2024 at 4:17 AM
20% worse than o1 and claude-latest at reasoning evals, but better than Qwen-72b, which is interesting?
December 5, 2024 at 3:03 AM
Update, tailwind4 totally kicks ass. This is all you need to create new utility classes usable in code... normal css and variables!
December 4, 2024 at 2:22 AM
skill issue
December 3, 2024 at 4:32 AM
My cat has gotten in the habit of coming to yell at me until I let him sit on my chest around this time. Funny wat to be forced to take a work break. Not allowed to say no to the cat.
December 3, 2024 at 2:27 AM
Do you find 300 characters limiting? Would you like to express yourself in way more than a single post can contain? Use the alt text!

It can contain almost 6x as many characters before you hit the size limits.

Click the alt text to see Canterbury Tales. the first 2,000 characters
November 28, 2024 at 12:28 AM
November 28, 2024 at 12:08 AM
He blocked me after this :-(
November 27, 2024 at 7:02 AM
Explaining the watershed algorithm
November 23, 2024 at 8:23 PM