Skrub
banner
skrub-data.bsky.social
Skrub
@skrub-data.bsky.social
skrub is a Python library to ease preprocessing and feature engineering for tabular machine learning.
Our long-term goal is to directly connect database tables to machine learning estimators.

https://skrub-data.org
https://discord.gg/ABaPnm7fDC
skrub DataOps help you construct complex and extensive hyperparameter search spaces. However, interpreting results from large grids can be challenging.
To address this, skrub generates a parallel coordinate plot that visualizes all runs and the parameters used to achieve specific results.
September 12, 2025 at 12:56 PM
The Skrub TableReport is a lightweight tool that allows to get a rich overview of a table quickly and easily.

✅ Filter columns
🔎 Look at each column's distribution
📊 Get a high level view of the distributions through stats and plots, including correlated columns
🌐 Export the report as html
April 23, 2025 at 11:49 AM
👀 Explore your dataframes interactively with TableReport.

AKA: 📈 we heard you liked plots so we put plots in your tables 📈
November 19, 2024 at 3:33 PM