James Futhey
banner
jamesfuthey.com
James Futhey
@jamesfuthey.com
🌈 Indie Hacker, Founder @Meetingroom365.com - Seattle / Taipei - jamesfuthey.com - Previously Analytics @Adobe, Design @HBO. @kidgdzilla on the legacy app. Building 🌟 transparent.se 🕹️ pmn.blue 🍍 indie.am/james
"Pornographic Images with Rivers"

This is what happens when you let an LLM label clusters based on alt tags, 0.001% of the time 🧐

www.transparent.se/image-cluste...

The alt tag is almost always "IEMBot Image TBD" so.... 🤡
May 31, 2025 at 2:54 PM
Clustering just fell into place. Only issues I had were pipeline stuff, none of the core assumptions were too far off, MiniBatchKMeans just... worked!

Took what I learned from autolabeling posts and applied that to alt tags. 4o-mini handled those.
May 4, 2025 at 2:18 PM
Everything went better than expected, basically successful on the first try.

MobileCLIP is WAY better than it deserves to be, given how performant it is.

Never got Zero shot out of it but the embeddings are very good.
May 4, 2025 at 2:18 PM
Every image posted to Bluesky in the last week, semantically clustered and labeled.

Jetstream -> Postgres -> Python (ML)

www.transparent.se/image-cluste...
May 4, 2025 at 2:18 PM
ahhhhhh I can't wait for image cluster labeling to finish, this is going to be such a cool little thing to share tomorrow.

I think it'll take 3 hours, I never optimized the script to do any of this in parallel, I just go to dinner and come back and it's done.
May 4, 2025 at 9:30 AM
Clustering pipeline on 6.6m Bluesky posts currently takes 1 hour 44 minutes on my Macbook Pro.

Good news is that it now actually works without any errors or bugs, so I can probably productionalize it and run it on a GPU next week instead.
May 4, 2025 at 5:56 AM
Pikmin update, 81 days to Seattle…

I don’t know why this is so funny

I think I get a sticker when he gets back 🤷
May 3, 2025 at 4:26 PM
I wonder if bro’s gonna make it back before I do…
May 2, 2025 at 2:52 PM
Threw up a Cluster Explorer for 🦋 posts
www.transparent.se/clusters.html

Not enough (40k) posts to say the clusters have stabilized yet, but you can view the centroids in x/y space, search, view random posts, etc.

It's kind of fun!

But, uhh, my embedding pipeline sucks. So it's only 40k posts 😅
April 22, 2025 at 4:39 PM