Nisan Haramati
banner
nisanharamati.bsky.social
Nisan Haramati
@nisanharamati.bsky.social
Data Systems for Infinite Scale, Math, Physics, Croissants. Founder.
Reposted by Nisan Haramati
This post breaks down why understanding precision and recall is essential when building search and information retrieval systems for high stakes decision making:

www.graphiumlabs.com/blog/precisi...
100% Recall and 100% Precision in Modern Search — Graphium Labs-
Precision and recall aren’t just technical jargon—they make up the difference between trust and risk in modern search systems. And they matter more than most people think.
www.graphiumlabs.com
July 24, 2025 at 6:44 PM
Reposted by Nisan Haramati
In high-stakes environments, like medical diagnostics, legal research, and threat detection, the trade off between high recall and high precision isn’t just a theoretical optimization problem. The choice has real-world consequences.
July 24, 2025 at 6:44 PM
Reposted by Nisan Haramati
Ideally, users want both at 100% – all (good) signal, and zero noise. But the way search works under the hood often forces a trade off: higher recall requires looser filters to bring in more results, and consequentially, more irrelevant results or noise, which bring down precision.
July 24, 2025 at 6:44 PM
Reposted by Nisan Haramati
Precision means: “Of the results that were returned, how many were relevant (correct)?”

And recall says: “Of all the correct results, how many were returned?”
July 24, 2025 at 6:44 PM
Reposted by Nisan Haramati
In search and information retrieval systems, precision and recall are more than just evaluation metrics—they reflect how well a system aligns with the user’s needs and expectations of relevance and completeness.
July 24, 2025 at 6:44 PM
So when nuance is important, semantic search built on vector similarity tends to miss the mark by a really, really wide margin.
July 1, 2025 at 6:37 PM
I'll start: vector embeddings don't encode semantics, they encode substitutability. It _looks right_ if you squint at it, or if the use case is pretty trivial (e.g. "brown" vs. "chocolate" when describing a sofa).

But opposites also have high substituability (good/bad, dark/light, rich/poor, etc.)
July 1, 2025 at 6:37 PM
I once failed the "check the checkbox" test by checking it... Wrong? I guess?
April 30, 2025 at 4:28 PM
Hey Tim let's talk.
March 18, 2025 at 3:15 AM