Joe Barrow
@jbarrow.bsky.social
100 followers
190 following
22 posts
NLP @ Pattern Data
Prev: Adobe Research, PhD UMD
Posts
Media
Videos
Starter Packs
Joe Barrow
@jbarrow.bsky.social
· May 31
Joe Barrow
@jbarrow.bsky.social
· Apr 23
Horseshoes (and Hand Grenades) - LLM Localization is not Close, but not Close Enough - Joe Barrow
TL;DRLarge Multimodal Models (LMMs) can now output bounding boxes when given images as inputs. The results are impressive, but for documents they aren't good enough for real world use, yet. The Probl…
notes.penpusher.app
Joe Barrow
@jbarrow.bsky.social
· Mar 11
Joe Barrow
@jbarrow.bsky.social
· Jan 7
Joe Barrow
@jbarrow.bsky.social
· Jan 7
Joe Barrow
@jbarrow.bsky.social
· Dec 21
Joe Barrow
@jbarrow.bsky.social
· Dec 21
Joe Barrow
@jbarrow.bsky.social
· Dec 20
Google Gemini 101 - Object Detection with Vision and Structured Outputs - Joe Barrow - Obsidian Publish
This is a missing manual for how to get a simple working prototype up and running with Gemini's vision mode and structured outputs. I'm confident that manual exists elsewhere, but I haven't been able…
publish.obsidian.md
Reposted by Joe Barrow
Joe Barrow
@jbarrow.bsky.social
· Dec 18
Joe Barrow
@jbarrow.bsky.social
· Dec 9
Joe Barrow
@jbarrow.bsky.social
· Nov 25