Lightnews — Scholar-powered news

Reposted by Antonin Raffin

EWRL18

@ewrl18.bsky.social

Exciting workshop for RL enthusiasts in Mannheim! 👇

Workshop on Reinforcement Learning 2026, taking place on 𝐅𝐞𝐛𝐫𝐮𝐚𝐫𝐲 𝟔, 𝟐𝟎𝟐𝟔, at the 𝐔𝐧𝐢𝐯𝐞𝐫𝐬𝐢𝐭𝐲 𝐨𝐟 𝐌𝐚𝐧𝐧𝐡𝐞𝐢𝐦, Germany.
Participation in the workshop is 𝐟𝐫𝐞𝐞 𝐨𝐟 𝐜𝐡𝐚𝐫𝐠𝐞!
Check the program and register: www.wim.uni-mannheim.de/doering/conf...

November 25, 2025 at 1:51 PM

Reposted by Antonin Raffin

Gautam Kamath

@gautamkamath.com

TMLR (@tmlrorg.bsky.social) is now proud to support interactive HTML-based submissions, going "Beyond PDF" -- check it out!

Thanks to Paul Vicol (@paulvicol.bsky.social) for his tireless work on this new option, as well as the OpenReview team.

Paul Vicol @paulvicol.bsky.social · 8h

🚀 Introducing TMLR Beyond PDF!

🎬 This is a new, HTML-based submission format for TMLR, that supports interactive figures and videos, along with the usual LaTeX and images.

🎉 Thanks to TMLR Editors in Chief: Hugo Larochelle, @gautamkamath.com, Naila Murray, Nihar B. Shah, and Laurent Charlin!

November 25, 2025 at 4:14 PM

Reposted by Antonin Raffin

Christian Hubicki

@chubicki.bsky.social

Wow. The backlash to the 1X Neo announcement has been widespread and *merciless*.

This may be a warning to lots of humanoids companies. All your promises don’t matter to the public if your robot looks or acts dumb.

youtu.be/b_SNExtznd4?...

Ronny Chieng Meets Neo, the World’s Stupidest Robot Maid | The Daily Show

YouTube video by The Daily Show

youtu.be

October 31, 2025 at 12:34 PM

Reposted by Antonin Raffin

TheGoodParts.dev

@thegoodparts.dev

Why self-taught engineers often outperform

michaelbastos.com/blog/why-sel...

#programming #softwaredevelopment #tech #blog

michaelbastos.com

October 29, 2025 at 7:34 PM

Reposted by Antonin Raffin

Pablo Samuel Castro

@pcastr.bsky.social

🚨The Formalism-Implementation Gap in RL research🚨

Lots of progress in RL research over last 10 years, but too much performance-driven => overfitting to benchmarks (like the ALE).

1⃣ Let's advance science of RL
2⃣ Let's be explicit about how benchmarks map to formalism

1/X

October 28, 2025 at 1:56 PM

Reposted by Antonin Raffin

prefix.dev

@prefix.dev

🚨 New blog post alert!

Modern package management for Robotics with Pixi!

prefix.dev/blog/reprod...

#ROS #ROSCon #ROSCon2025

Pixi: Modern package management for Robotics

Developing Robots is hard; Pixi makes it easier by creating reproducible, cross-platform ROS development environments without Docker or Ubuntu lock-in.

prefix.dev

October 24, 2025 at 3:34 PM

Reposted by Antonin Raffin

Eugene Vinitsky 🍒

@eugenevinitsky.bsky.social

What if we did a single run and declared victory

Three panel thing. In the left panel we use error bars. In the second, we take statistical significance as the biggest number but still have error bars. In LLM science, we just have the biggest number

October 23, 2025 at 2:28 AM

Antonin Raffin

@araffin.bsky.social

A wonderful collection of spurious correlations, correlation is not causation.

link: www.tylervigen.com/spurious-cor...

found via @stefanjudis.com newsletter

October 21, 2025 at 5:44 AM

Antonin Raffin

@araffin.bsky.social

A good video on software refactoring and redesign (about the Audacity audio editing program)

Tantacrul @tantacrul.bsky.social · Oct 3

New video is OUT! - How We're Building Audacity 4

youtu.be/QYM3TWf_G38?...

October 20, 2025 at 10:20 AM

Reposted by Antonin Raffin

Michiel van de Panne

@mvandepanne.bsky.social

Video recordings of CORL 2025 talks now available! Many interesting orals / keynotes / sponsor talks / early-career talks / poster spotlights.
Day 1: www.youtube.com/watch?v=Use5...
Day 2: www.youtube.com/watch?v=rh2o...
Day 3: www.youtube.com/watch?v=9lzF...

CORL 2025

YouTube video by Conference on Robot Learning

www.youtube.com

October 17, 2025 at 5:31 AM

Reposted by Antonin Raffin

prefix.dev

@prefix.dev

In our little deep dive series we're now exploring how cross-compilation in the Conda ecosystem works: prefix.dev/blog/cross-c.... Back in the days, @conda-forge.org rolled this out widely to support osx-arm64 early on, and now for linux-aarch64/ppc64le.

Cross compiling in the Conda ecosystem

Cross compiling is a fundamental capability in modern software development, allowing developers to build packages for different architectures without needing access to the target hardware.

prefix.dev

October 15, 2025 at 6:11 AM

Reposted by Antonin Raffin

Daphne Cornelisse

@daphne-cornelisse.bsky.social

Rapid RL experimentation is great. But how do you catch silent errors before they slip by?

In this post, I share tools and habits that help me move quickly from idea to result without sacrificing reliability.

How to catch subtle RL bugs before they catch you

Tools and habits for reliable, fast RL experimentation and development

open.substack.com

October 13, 2025 at 11:29 AM

Reposted by Antonin Raffin

Grant Sanderson

@3blue1brown.com

Ever since I made a video about Fourier Transforms, one of the most requested topics on the channel has been its close cousin, the Laplace Transform.

I've been having a lot of fun animating a mini-series about this topic, and the main part is now out.

youtu.be/j0wJBEZdwLs

But what is a Laplace Transform?

YouTube video by 3Blue1Brown

youtu.be

October 12, 2025 at 12:49 PM

Reposted by Antonin Raffin

Sebastian Raschka (rasbt)

@sebastianraschka.com

Updated & turned my Big LLM Architecture Comparison article into a video lecture.

The 11 LLM archs covered in this video:
1. DeepSeek V3/R1
2. OLMo 2
3. Gemma 3
4. Mistral Small 3.1
5. Llama 4
6. Qwen3
7. SmolLM3
8. Kimi 2
9. GPT-OSS
10. Grok 2.5
11. GLM-4.5/4.6

www.youtube.com/watch?v=rNlU...

The Big LLM Architecture Comparison

YouTube video by Sebastian Raschka

www.youtube.com

October 10, 2025 at 5:05 PM

Antonin Raffin

@araffin.bsky.social

Mjlab

Isaac Lab API, powered by MuJoCo-Warp, for RL and robotics research.

github.com/mujocolab/mj...

GitHub - mujocolab/mjlab: Isaac Lab API, powered by MuJoCo-Warp, for RL and robotics research.

Isaac Lab API, powered by MuJoCo-Warp, for RL and robotics research. - mujocolab/mjlab

github.com

October 10, 2025 at 9:35 AM

Antonin Raffin

@araffin.bsky.social

SBX (SB3 Jax) v0.23.0 is out =)!

I added CNN support for PPO.
It turns out that using a shared features extractor (CNN in this case) is important for achieving good performance on Atari games.

Perf report: wandb.ai/openrlbenchm...

github.com/araffin/sbx

GitHub - araffin/sbx: SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms

SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms - araffin/sbx

github.com

September 29, 2025 at 5:23 PM

Antonin Raffin

@araffin.bsky.social

Training a small humanoid robot with reinforcement learning using another robot for reset.

by Kaizhe Hu et al. (ToddlerBot Stanford)

Project page: robot-trains-robot.github.io

a robot arm support a robot humanoid on a treadmill

September 29, 2025 at 8:48 AM

Antonin Raffin

@araffin.bsky.social

Open-Source Hardware in the Era of Robot Learning Workshop @ CoRL 2025

Website: open-hardware-robots.github.io/CoRL2025/

September 27, 2025 at 6:19 AM

Reposted by Antonin Raffin

Stéphane Caron

@locoscaron.fosstodon.org.ap.brid.gy

The CoRL 2025 workshop on Open-Source Hardware in the Era of Robot Learning is starting now! You can join the conversation online via live streaming: https://www.youtube.com/live/ZVPIJzF1df4

September 27, 2025 at 12:32 AM

Reposted by Antonin Raffin

sophie-xhonneux.bsky.social

@sophie-xhonneux.bsky.social

📣 Call for Blog Posts at #ICLR2026 @iclr_conf

Following the success of the past iterations, we are opening the Call for Blog Posts 2026!

iclr-blogposts.github.io/2026/about/#...

Please retweet!

abs-0.twimg.com

September 22, 2025 at 7:44 AM

Antonin Raffin

@araffin.bsky.social

A practical introduction to (deep) RL, providing intuitions to understand the more recent algorithms.

The plan is to start from tabular Q-learning and work our way up to Deep Q-learning (DQN). In a following post, I will continue on to Soft Actor-Critic (SAC) and its extensions.

Antonin Raffin @araffin.bsky.social · Sep 18

RL102: From Tabular Q-Learning to Deep Q-Learning (DQN) - A Practical Introduction to (Deep) Reinforcement Learning

araffin.github.io/post/rl102/

RL102: From Tabular Q-Learning to Deep Q-Learning (DQN) | Antonin Raffin | Homepage

This blog post is meant to be a practical introduction to (deep) reinforcement learning1, presenting the main concepts and providing intuitions to understand the more recent Deep RL algorithms. For a ...

araffin.github.io

September 22, 2025 at 8:06 AM

Reposted by Antonin Raffin

Stéphane Caron

@locoscaron.fosstodon.org.ap.brid.gy

Next Saturday, 𝗔𝗻𝘁𝗼𝗶𝗻𝗲 𝗣𝗶𝗿𝗿𝗼𝗻𝗲 will present Pollen Robotics & Hugging Face's open-source robots, including Reachy Mini, the SO-100 arm, the Amazing Hand and the Open Duck Mini. He will discuss the sim2real challenges of making the Open Duck Mini walk, and how […]

[Original post on fosstodon.org]

The Open Duck Mini open-source and open-hardware robot.

September 21, 2025 at 12:23 PM

Antonin Raffin

@araffin.bsky.social

RL102: From Tabular Q-Learning to Deep Q-Learning (DQN) - A Practical Introduction to (Deep) Reinforcement Learning

araffin.github.io/post/rl102/

RL102: From Tabular Q-Learning to Deep Q-Learning (DQN) | Antonin Raffin | Homepage

This blog post is meant to be a practical introduction to (deep) reinforcement learning1, presenting the main concepts and providing intuitions to understand the more recent Deep RL algorithms. For a ...

araffin.github.io

September 18, 2025 at 3:09 PM

Reposted by Antonin Raffin

prefix.dev

@prefix.dev

Package building with Pixi is being rolled out! Dive into our latest blog post on crafting C++ packages.

And guess what? It’s not just for C++; Pixi plays nice with Python, Rust, ROS, Mojo, and beyond!

prefix.dev/blog/pixi-b...