Lightnews — Scholar-powered news

Mirco Mutti

@mircomutti.bsky.social

1.4K followers 300 following 33 posts

Reinforcement learning, but without rewards.
Postdoc at the Technion. PhD from Politecnico di Milano.
https://muttimirco.github.io

Posts Replies Media Videos

Mirco Mutti

@mircomutti.bsky.social

Humans typically develop a standard strategy prescribing a sequence of tests to diagnose the condition before committing to the best treatment (see img left). Can we design a bandit algorithm that learns a similarly interpretable exploration but it's also provably efficient?

3/n

July 15, 2025 at 3:50 PM

Mirco Mutti

@mircomutti.bsky.social

If interested on our take on addressing inverse RL in large state spaces, go to meet @filippo_lazzati and @alberto_metelli in the poster session 5 #NeurIPS2024 today (paper -> arxiv.org/abs/2406.03812)

December 13, 2024 at 2:33 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news