Sikata Sengupta
sikatasengupta.bsky.social
Sikata Sengupta
@sikatasengupta.bsky.social
cs phd @upenn advised by Michael Kearns, Aaron Roth, and Duncan Watts| previously @stanford | she/her
https://psamathe50.github.io/sikatasengupta/
Reposted by Sikata Sengupta
I think I posted about it before but never with a thread. We recently put a new preprint on arxiv.

📖 Replicable Reinforcement Learning with Linear Function Approximation

🔗 arxiv.org/abs/2509.08660

In this paper, we study formal replicability in RL with linear function approximation. The... (1/6)
Replicable Reinforcement Learning with Linear Function Approximation
Replication of experimental results has been a challenge faced by many scientific disciplines, including the field of machine learning. Recent work on the theory of machine learning has formalized rep...
arxiv.org
October 26, 2025 at 2:16 PM
Reposted by Sikata Sengupta
hi bluesky 👋 I’m starting a blog! First post on how I use GenAI in my workflow as an academic. give it a read + tell me what you think:
yeganeha.substack.com/p/academic-p... #GenAI #Academia
Academic Productivity with GenAI: A Researcher’s Guide
My Everyday Use of GenAI as a Researcher
yeganeha.substack.com
June 9, 2025 at 3:45 PM
Reposted by Sikata Sengupta
Dhruv Rohatgi will be giving a lecture on our recent work on comp-stat tradeoffs in next-token prediction at the RL Theory virtual seminar series (rl-theory.bsky.social) tomorrow at 2pm EST! Should be a fun talk---come check it out!!
May 26, 2025 at 7:19 PM
Reposted by Sikata Sengupta
Later today, Sikata and Marcel will talk about their recent work on oracle-efficient RL with ensembles. Join us!
May 20, 2025 at 3:48 PM
Reposted by Sikata Sengupta
Last seminars before the summer break:

04/29: Max Simchowitz (CMU)
05/06: Jeongyeol Kwon (Univ. of Widsconsin-Madison)
05/20: Sikata Sengupta & Marcel Hussing (Univ. of Pennsylvania)
05/27: Dhruv Rohatgi (MIT)
06/03: David Janz (Univ. of Oxford)
06/10: Nneka Okolo (MIT)
April 16, 2025 at 5:20 PM
@mkearnsphilly.bsky.social is now on bsky as well!
If you are at #NeurIPS, we will be presenting this work (#6610) from 4:30-7:30PM today and would love to chat! @marcelhussing.bsky.social @optimistsinc.bsky.social @aaroth.bsky.social
Actual content post: Have not talked much about this work yet but we have a paper on Oracle-Efficient Reinforcement Learning for Max Value Ensembles at this year's #NeurIPS. We provide an efficient algorithm to ensemble policies given a value function oracle. arxiv.org/abs/2405.16739
December 12, 2024 at 5:51 PM
If you are at #NeurIPS, we will be presenting this work (#6610) from 4:30-7:30PM today and would love to chat! @marcelhussing.bsky.social @optimistsinc.bsky.social @aaroth.bsky.social
Actual content post: Have not talked much about this work yet but we have a paper on Oracle-Efficient Reinforcement Learning for Max Value Ensembles at this year's #NeurIPS. We provide an efficient algorithm to ensemble policies given a value function oracle. arxiv.org/abs/2405.16739
Oracle-Efficient Reinforcement Learning for Max Value Ensembles
Reinforcement learning (RL) in large or infinite state spaces is notoriously challenging, both theoretically (where worst-case sample and computational complexities must scale with state space cardina...
arxiv.org
December 12, 2024 at 5:29 PM
Reposted by Sikata Sengupta
I made a starter pack for learning theory people to gather some people around the topic. There are too many names on here that I don't know so I only added a few I do. If you believe you should be on this list, let me know. I will add people with accurate profile descriptions.

go.bsky.app/21nFz12
November 10, 2024 at 6:08 PM
Reposted by Sikata Sengupta
Actual content post: Have not talked much about this work yet but we have a paper on Oracle-Efficient Reinforcement Learning for Max Value Ensembles at this year's #NeurIPS. We provide an efficient algorithm to ensemble policies given a value function oracle. arxiv.org/abs/2405.16739
Oracle-Efficient Reinforcement Learning for Max Value Ensembles
Reinforcement learning (RL) in large or infinite state spaces is notoriously challenging, both theoretically (where worst-case sample and computational complexities must scale with state space cardina...
arxiv.org
November 10, 2024 at 5:51 PM