Alberto Torres Barrán
albertotb.com
Alberto Torres Barrán
@albertotb.com
Machine Learning at https://komorebi.ai/
Reposted by Alberto Torres Barrán
A step-by-step guide to building a key-value database from scratch: www.nan.fyi/database - love the explainer/interactive animation in this
November 14, 2025 at 4:27 PM
Reposted by Alberto Torres Barrán
crazy

IG em_clarkson
November 14, 2025 at 2:54 AM
Reposted by Alberto Torres Barrán
Why is it that all modern apps with “feeds” like social media apps or even reading apps like Medium, Substack, etc. exhibit the anti pattern of refreshing and losing your spot if you switch away for even a minute?
It’s absolutely infuriating.
October 28, 2025 at 11:54 AM
Reposted by Alberto Torres Barrán
💡 Came across this nice tool today:

🎨 qualpal for algorithmically choosing maximally distinct colors under certain restrictions #dataviz

JOSS paper, online tool, R package #rstats

joss.theoj.org/papers/10.21...
Qualpal: Qualitative Color Palettes for Everyone
Larsson, J., (2025). Qualpal: Qualitative Color Palettes for Everyone. Journal of Open Source Software, 10(114), 8936, https://doi.org/10.21105/joss.08936
joss.theoj.org
October 17, 2025 at 9:12 AM
Reposted by Alberto Torres Barrán
"I have a decent fluency in LLMs, and they have utility, but the absurd degree of over-hype, the way they're being forced on everyone, and the insistence on ignoring the many valid critiques about them make it very difficult to focus on legitimate uses where they might add value."
October 17, 2025 at 4:32 AM
Reposted by Alberto Torres Barrán
(1/3) Introduction to Multi-Stage Build 🐳👇🏼

The size of the image is a function of its dependencies and the efficiency of the image build. This tutorial focuses on resizing a #Python image using minimal images as our baseline and the multi-stage method.

medium.com/data-science...

#docker #mlops
Introduction to Multi-Stage Image Build for Python
This post introduces the Multi-Stage build approach for setting up a lightweight dockerized Python development environment.
medium.com
October 16, 2025 at 1:07 PM
Reposted by Alberto Torres Barrán
yes
October 13, 2025 at 12:14 PM
Reposted by Alberto Torres Barrán
I've turned this blog post cheatsheet into a downloadable cheatsheet.

You can get the cheatsheet from here: mathspp.com/blog/uv-chea...
October 10, 2025 at 8:34 AM
Reposted by Alberto Torres Barrán
Regresa @picanumeros.bsky.social para poner los superíndices sobre los sumatorios: contra la brocha gorda estadística y la varianza de barra de bar:

«No se están “cocinando” los datos de forma maquiavélica para mostrar una realidad que no es»

www.sustrato.io/textos/homer...
Homer y la estadística pública | Ramón Ferri (Picanúmeros)
Hay que decirlo más: la estadística pública está en el punto de mira y nos afecta más de lo que pensamos.
www.sustrato.io
October 2, 2025 at 8:59 AM
Reposted by Alberto Torres Barrán
For those on LinkedIn who aren't aware. By default all profiles have now 'Data for Generative AI Improvement' turned on.

You can turn it off via: Settings > Data Privacy > Data for Generative AI Improvement > Select the OFF option.

Do not feed the planet-destroying machine!✊
September 19, 2025 at 7:06 AM
Reposted by Alberto Torres Barrán
I love this analytical take on video game categorization from Antoine Mayerowitz and Julie Belzanne:

hushcrasher.substack.com/p/taxonomy-o...

Instead of trying to vibe-intuit the definition of an "indie" game, the authors analyzed the data from the perspective of game size and credits length
September 14, 2025 at 10:50 AM
Reposted by Alberto Torres Barrán
Reposted by Alberto Torres Barrán
Our Python doc is officially out in the wild! 🐍

Thanks to everyone who joined the premiere 🙌 such a good vibe.

Here’s the link so you can watch it on repeat youtu.be/GfH4QL4VqJ0
Python: The Documentary | An origin story
YouTube video by CultRepo (formerly Honeypot)
youtu.be
August 29, 2025 at 12:00 AM
Reposted by Alberto Torres Barrán
A research team from Tsinghua University, Stanford University, and the Max Planck Institute for Informatics. has developed the first deterministic algorithm since 1984 that improves on the long-standing O(m + n log n) bound for finding the shortest paths from a single starting point to all
August 13, 2025 at 1:47 PM
Very cool! skore is criminally underrated

youtu.be/mmcRcIY13GE?...
Introducing the data accessor of the EstimatorReport
YouTube video by probabl
youtu.be
August 12, 2025 at 11:57 AM
Reposted by Alberto Torres Barrán
Spent some time refining the approach and benchmarking it. Just published a blog post about the core idea.

Can a bunch of LLM Agents be used to rank an arbitrary set of items in a consistent way? 🤖

davidgasquez.com/ranking-with...
August 8, 2025 at 1:38 PM
Reposted by Alberto Torres Barrán
Good article. Instead of trying to build products promising to replace humans with "superintelligence", consider the *actual* shape of language models and their capabilities. We have a great floor-raiser here that can be applied to bridge capability gaps.

elroy.bot/blog/2025/07...
AI is a Floor Raiser, not a Ceiling Raiser - Elroy
An AI assistant that remembers and sets goals
elroy.bot
August 4, 2025 at 2:16 AM
Reposted by Alberto Torres Barrán
This post tries to explain why I find language models exciting. But it doesn't try to persuade skeptics that they should agree. It's aimed more at people already working with AI, and its goal is to sharpen our collective sense of what the upside potential might be. #MLSky 🤖 🧪
A more interesting upside of AI
Does AI provide anything to look forward to, if “super-intelligence” sounds boring?
tedunderwood.com
July 2, 2025 at 2:49 PM
Reposted by Alberto Torres Barrán
This is a nice and clear "overview of the state of RAG"

hamel.dev/notes/llm/ra...

(via @arnicas.bsky.social's wonderful newsletter)
P1: I don’t use RAG, I just retrieve documents – Hamel’s Blog
Ben Clavié’s introduction to advanced retrieval techniques
hamel.dev
July 3, 2025 at 6:17 AM
Reposted by Alberto Torres Barrán
ICYMI githistory.xyz is a great way to navigate #git commits and visualise how a file has been changing across commits

Just replace github.com with github.githistory.xyz in the URL and enjoy! #rstats
June 19, 2025 at 5:34 PM
Reposted by Alberto Torres Barrán
👀 This week's post is a sneak peek into the next major Skrub feature, Skrub expressions 🚀

As this is a preview of an upcoming feature, we are looking for your thoughts and feedback before release.
April 30, 2025 at 10:00 AM
Reposted by Alberto Torres Barrán
You’ve probably heard about how AI/LLMs can solve Math Olympiad problems ( deepmind.google/discover/blo... ).

So naturally, some people put it to the test — hours after the 2025 US Math Olympiad problems were released.

The result: They all sucked!
March 31, 2025 at 8:33 PM