Sikata Sengupta
@sikatasengupta.bsky.social
cs phd @upenn advised by Michael Kearns, Aaron Roth, and Duncan Watts| previously @stanford | she/her
https://psamathe50.github.io/sikatasengupta/
https://psamathe50.github.io/sikatasengupta/
Reposted by Sikata Sengupta
I think I posted about it before but never with a thread. We recently put a new preprint on arxiv.
📖 Replicable Reinforcement Learning with Linear Function Approximation
🔗 arxiv.org/abs/2509.08660
In this paper, we study formal replicability in RL with linear function approximation. The... (1/6)
📖 Replicable Reinforcement Learning with Linear Function Approximation
🔗 arxiv.org/abs/2509.08660
In this paper, we study formal replicability in RL with linear function approximation. The... (1/6)
Replicable Reinforcement Learning with Linear Function Approximation
Replication of experimental results has been a challenge faced by many scientific disciplines, including the field of machine learning. Recent work on the theory of machine learning has formalized rep...
arxiv.org
October 26, 2025 at 2:16 PM
I think I posted about it before but never with a thread. We recently put a new preprint on arxiv.
📖 Replicable Reinforcement Learning with Linear Function Approximation
🔗 arxiv.org/abs/2509.08660
In this paper, we study formal replicability in RL with linear function approximation. The... (1/6)
📖 Replicable Reinforcement Learning with Linear Function Approximation
🔗 arxiv.org/abs/2509.08660
In this paper, we study formal replicability in RL with linear function approximation. The... (1/6)
Come say hi at our #ICML poster today during Poster Session 1 (W-600)! Joint work with @ericeaton.bsky.social @marcelhussing.bsky.social @optimistsinc.bsky.social @aaroth.bsky.social @mkearnsphilly.bsky.social!
arxiv.org/abs/2502.11828
arxiv.org/abs/2502.11828
Intersectional Fairness in Reinforcement Learning with Large State and Constraint Spaces
In traditional reinforcement learning (RL), the learner aims to solve a single objective optimization problem: find the policy that maximizes expected reward. However, in many real-world settings, it ...
arxiv.org
July 15, 2025 at 2:38 PM
Come say hi at our #ICML poster today during Poster Session 1 (W-600)! Joint work with @ericeaton.bsky.social @marcelhussing.bsky.social @optimistsinc.bsky.social @aaroth.bsky.social @mkearnsphilly.bsky.social!
arxiv.org/abs/2502.11828
arxiv.org/abs/2502.11828
Reposted by Sikata Sengupta
hi bluesky 👋 I’m starting a blog! First post on how I use GenAI in my workflow as an academic. give it a read + tell me what you think:
yeganeha.substack.com/p/academic-p... #GenAI #Academia
yeganeha.substack.com/p/academic-p... #GenAI #Academia
Academic Productivity with GenAI: A Researcher’s Guide
My Everyday Use of GenAI as a Researcher
yeganeha.substack.com
June 9, 2025 at 3:45 PM
hi bluesky 👋 I’m starting a blog! First post on how I use GenAI in my workflow as an academic. give it a read + tell me what you think:
yeganeha.substack.com/p/academic-p... #GenAI #Academia
yeganeha.substack.com/p/academic-p... #GenAI #Academia
Reposted by Sikata Sengupta
Dhruv Rohatgi will be giving a lecture on our recent work on comp-stat tradeoffs in next-token prediction at the RL Theory virtual seminar series (rl-theory.bsky.social) tomorrow at 2pm EST! Should be a fun talk---come check it out!!
May 26, 2025 at 7:19 PM
Dhruv Rohatgi will be giving a lecture on our recent work on comp-stat tradeoffs in next-token prediction at the RL Theory virtual seminar series (rl-theory.bsky.social) tomorrow at 2pm EST! Should be a fun talk---come check it out!!
Reposted by Sikata Sengupta
Later today, Sikata and Marcel will talk about their recent work on oracle-efficient RL with ensembles. Join us!
May 20, 2025 at 3:48 PM
Later today, Sikata and Marcel will talk about their recent work on oracle-efficient RL with ensembles. Join us!
Reposted by Sikata Sengupta
Last seminars before the summer break:
04/29: Max Simchowitz (CMU)
05/06: Jeongyeol Kwon (Univ. of Widsconsin-Madison)
05/20: Sikata Sengupta & Marcel Hussing (Univ. of Pennsylvania)
05/27: Dhruv Rohatgi (MIT)
06/03: David Janz (Univ. of Oxford)
06/10: Nneka Okolo (MIT)
04/29: Max Simchowitz (CMU)
05/06: Jeongyeol Kwon (Univ. of Widsconsin-Madison)
05/20: Sikata Sengupta & Marcel Hussing (Univ. of Pennsylvania)
05/27: Dhruv Rohatgi (MIT)
06/03: David Janz (Univ. of Oxford)
06/10: Nneka Okolo (MIT)
April 16, 2025 at 5:20 PM
Last seminars before the summer break:
04/29: Max Simchowitz (CMU)
05/06: Jeongyeol Kwon (Univ. of Widsconsin-Madison)
05/20: Sikata Sengupta & Marcel Hussing (Univ. of Pennsylvania)
05/27: Dhruv Rohatgi (MIT)
06/03: David Janz (Univ. of Oxford)
06/10: Nneka Okolo (MIT)
04/29: Max Simchowitz (CMU)
05/06: Jeongyeol Kwon (Univ. of Widsconsin-Madison)
05/20: Sikata Sengupta & Marcel Hussing (Univ. of Pennsylvania)
05/27: Dhruv Rohatgi (MIT)
06/03: David Janz (Univ. of Oxford)
06/10: Nneka Okolo (MIT)
@mkearnsphilly.bsky.social is now on bsky as well!
If you are at #NeurIPS, we will be presenting this work (#6610) from 4:30-7:30PM today and would love to chat! @marcelhussing.bsky.social @optimistsinc.bsky.social @aaroth.bsky.social
Actual content post: Have not talked much about this work yet but we have a paper on Oracle-Efficient Reinforcement Learning for Max Value Ensembles at this year's #NeurIPS. We provide an efficient algorithm to ensemble policies given a value function oracle. arxiv.org/abs/2405.16739
December 12, 2024 at 5:51 PM
@mkearnsphilly.bsky.social is now on bsky as well!
If you are at #NeurIPS, we will be presenting this work (#6610) from 4:30-7:30PM today and would love to chat! @marcelhussing.bsky.social @optimistsinc.bsky.social @aaroth.bsky.social
Actual content post: Have not talked much about this work yet but we have a paper on Oracle-Efficient Reinforcement Learning for Max Value Ensembles at this year's #NeurIPS. We provide an efficient algorithm to ensemble policies given a value function oracle. arxiv.org/abs/2405.16739
Oracle-Efficient Reinforcement Learning for Max Value Ensembles
Reinforcement learning (RL) in large or infinite state spaces is notoriously challenging, both theoretically (where worst-case sample and computational complexities must scale with state space cardina...
arxiv.org
December 12, 2024 at 5:29 PM
If you are at #NeurIPS, we will be presenting this work (#6610) from 4:30-7:30PM today and would love to chat! @marcelhussing.bsky.social @optimistsinc.bsky.social @aaroth.bsky.social
Reposted by Sikata Sengupta
I made a starter pack for learning theory people to gather some people around the topic. There are too many names on here that I don't know so I only added a few I do. If you believe you should be on this list, let me know. I will add people with accurate profile descriptions.
go.bsky.app/21nFz12
go.bsky.app/21nFz12
November 10, 2024 at 6:08 PM
I made a starter pack for learning theory people to gather some people around the topic. There are too many names on here that I don't know so I only added a few I do. If you believe you should be on this list, let me know. I will add people with accurate profile descriptions.
go.bsky.app/21nFz12
go.bsky.app/21nFz12
Reposted by Sikata Sengupta
Actual content post: Have not talked much about this work yet but we have a paper on Oracle-Efficient Reinforcement Learning for Max Value Ensembles at this year's #NeurIPS. We provide an efficient algorithm to ensemble policies given a value function oracle. arxiv.org/abs/2405.16739
Oracle-Efficient Reinforcement Learning for Max Value Ensembles
Reinforcement learning (RL) in large or infinite state spaces is notoriously challenging, both theoretically (where worst-case sample and computational complexities must scale with state space cardina...
arxiv.org
November 10, 2024 at 5:51 PM
Actual content post: Have not talked much about this work yet but we have a paper on Oracle-Efficient Reinforcement Learning for Max Value Ensembles at this year's #NeurIPS. We provide an efficient algorithm to ensemble policies given a value function oracle. arxiv.org/abs/2405.16739