Lightnews — Scholar-powered news

Light up
your news

About Privacy Terms Help

Abhishek Sharma

Abhishek Sharma

@abhishekshar.bsky.social

43 followers 200 following 7 posts

CS PhD @Harvard w/ Finale Doshi-Velez | Research in {Reinforcement Learning | Healthcare | Representation Learning}

🌐 https://abhishekshar.com/

Posts Replies Media Videos

Abhishek Sharma

@abhishekshar.bsky.social

Our paper: Decision-Point Guided Safe Policy Improvement
We show that a simple approach to learn safe RL policies can outperform most offline RL methods. (+theoretical guarantees!)

How? Just allow the state-actions that have been seen enough times! 🤯

arxiv.org/abs/2410.09361

Decision-Point Guided Safe Policy Improvement

Within batch reinforcement learning, safe policy improvement (SPI) seeks to ensure that the learnt policy performs at least as well as the behavior policy that generated the dataset. The core challeng...

January 23, 2025 at 6:23 PM

Abhishek Sharma

@abhishekshar.bsky.social

Wow this is amazing! Thanks for sharing!

December 9, 2024 at 6:32 PM

Abhishek Sharma

@abhishekshar.bsky.social

The notes are great! Thank you!

November 22, 2024 at 3:23 PM

Abhishek Sharma

@abhishekshar.bsky.social

Would be cool to be included! (work on RL in healthcare)..

November 22, 2024 at 4:30 AM