Lightnews — Scholar-powered news

CPAIOR

@cpaior.bsky.social

Our final session at CPAIOR contains a varied number of topics: learning, algorithm configuration and MDPs. As a suprise speaker, Bistra Dilkina is presenting the work "Accelerated Discovery of Set Cover Solutions via Graph Neural Networks" as the authors were unavailable.
doi.org/10.1007/978-...

November 13, 2025 at 4:35 AM

Khanter

@domitorsbane.bsky.social

Also I think it's very funny I keep thinking of myself as a mdps main because that's what I started as - when I was already switching to tanks by next year.

It makes sense, I switched fully by Endwalker (Dec 2021) because dps queues sucked solo, but it feels insane to say 'Endwalker 4 years ago'.

November 8, 2025 at 4:15 PM

Marty Puterman

@martypute.bsky.social

Chapters 1-7 of our forthcoming book “Markov Decision Processes and Reinforcement Learning” by Puterman and Chan can be dowloaded at:
github.com/martyput/MDP....
They cover model foundations, and MDPs. Please share comments and/or typos ASAP.
#orms #AI #ReinforcementLearning

September 25, 2025 at 5:06 PM

physics.soc-ph

@arxiv-soc-ph.bsky.social

State Discretization for Continuous-State MDPs in Infectious Disease Control
https://arxiv.org/abs/2404.12540

October 8, 2024 at 3:46 PM

HackerNoon

@hackernoon.com

Discover how Anc-VI converges to fixed points in undiscounted MDPs (γ = 1), addressing challenges typically overlooked in traditional DP and RL theory. #reinforcementlearning

Why Anc-VI is Crucial for Undiscounted Reinforcement Learning

hackernoon.com

January 14, 2025 at 10:56 PM

arXiv cs.LG Machine Learning

@cslg-bot.bsky.social

Antoine Moulin, Gergely Neu, Luca Viano: Inverse Q-Learning Done Right: Offline Imitation Learning in $Q^\pi$-Realizable MDPs https://arxiv.org/abs/2505.19946 https://arxiv.org/pdf/2505.19946 https://arxiv.org/html/2505.19946

May 27, 2025 at 6:56 AM

鳴潮 Wuthering Waves

@wuthering-waves.bsky.social

Changli MDPSは狂った楽しいです

https://www.playing-games.com/719611/

どういうわけか、ChangliをHyperCarry MDPSとして通常のQuickSwapスタイルlmaoooとして使用してもっと楽しくする by K-Rie7

Changli MDPSは狂った楽しいです - Playing Games

どういうわけか、ChangliをHyperCarry MDPSとして通常のQuickSwapスタイルlmaoooとして使用してもっと楽しくする

www.playing-games.com

July 14, 2025 at 6:30 AM

arXiv cs.LO Logic in Computer Science

@cslo-bot.bsky.social

arXiv:2504.15960v1 Announce Type: new
Abstract: We consider multiple-environment Markov decision processes (MEMDP), which consist of a finite set of MDPs over the same state space, representing different scenarios of transition structure and [1/5 of https://arxiv.org/abs/2504.15960v1]

April 23, 2025 at 5:58 AM

arXiv cs.FL Formal Languages and Automata Theory

@csfl-bot.bsky.social

that can be incremented or decremented during each state transition. VASS MDPs can be used as abstractions of probabilistic programs with many decidable properties. In this paper, we develop techniques for analyzing the asymptotic behavior of VASS [2/6 of https://arxiv.org/abs/2503.05006v1]

March 10, 2025 at 5:56 AM

arXiv stat.ML Machine Learning

@statml-bot.bsky.social

shows that it is a generalisation of Thompson sampling from multi-arm bandit problems. Finally, we evaluate our framework on the Deep Sea benchmark problem and demonstrate the exploration benefits of posterior sampling in MDPs. [8/8 of https://arxiv.org/abs/2505.01859v1]

May 6, 2025 at 6:18 AM

Cam 🍍

@pineapplefever.bsky.social

Now that I've finally finished leveling all the jobs

Favorite tank: Warrior
Least fav: Dark Knight

Favorite healer: Scholar
Least fav: Astrological

Favorite mDps: Monk
Least fav: Samurai

Favorite ranged: Dancer
Least fav: Machinist

Favorite mDps: Red Mage
Least fav: Black Mage

October 17, 2025 at 11:19 AM

유스티아

@eustiana.de

All of old Hyundai and Kia cars have this issue, order version(about 7 Yrs over)car's MDPS have.
If this rubber part has broken on your MDPS motor, it could make steering-lock problem, will trouble on the road.
If your steering-wheel make some noise, you should check this on the skilled garage shop.

July 7, 2023 at 4:02 AM

Antoine Moulin

@antoine-mln.bsky.social

Chaps 2-4-5-6 of Puterman’s book for the basics on MDPs?

November 25, 2024 at 6:20 PM

Rahul G. Krishnan

@rahulgk.bsky.social

In arxiv.org/abs/2404.07266, Vahid shows how to use offline expert data with unobserved confounding to guide decision making using a nonparametric prior to guide learning policies for bandits, MDPs, and POMDPs.

Thu 12 Dec 4:30 - 7:30 pm PST 📷 West Ballroom A-D Poster #6708

🧵(3/7)

Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity

We study the problem of online sequential decision-making given auxiliary demonstrations from experts who made their decisions based on unobserved contextual information. These demonstrations can be v...

arxiv.org

December 11, 2024 at 12:20 AM

KirokoSensei

@kiropika.bsky.social

Trop content de ma team autour de Xilonen MDPS <3 #GenshinImpact

October 20, 2024 at 12:10 PM

arXiv cs.LG Machine Learning

@cslg-bot.bsky.social

Ankita Kushwaha, Kiran Ravish, Preeti Lamba, Pawan Kumar: A Survey of Safe Reinforcement Learning and Constrained MDPs: A Technical Survey on Single-Agent and Multi-Agent Safety https://arxiv.org/abs/2505.17342 https://arxiv.org/pdf/2505.17342 https://arxiv.org/html/2505.17342

May 26, 2025 at 6:02 AM

arXiv cs.LO Logic in Computer Science

@cslo-bot.bsky.social

Krishnendu Chatterjee, Laurent Doyen, Jean-Fran\c{c}ois Raskin, Ocan Sankur: The Value Problem for Multiple-Environment MDPs with Parity Objective https://arxiv.org/abs/2504.15960 https://arxiv.org/pdf/2504.15960 https://arxiv.org/html/2504.15960

April 23, 2025 at 5:58 AM

Amir Mesbah

@amirmesbah.bsky.social

I came across a couple of other definitions that might be helpful to mention (apologies if you’re already considering these).
The first one is from Csaba Szepesvári’s RL theory lecture notes (lecture 2, planning in MDPs), and the second one is from Puterman's MDP book (chapter 1).

Definition of dynamic programming in RL, from Csaba Szepesvári’s RL theory lecture notes (Lecture 2, "Planning in MDPs")

Definition of dynamic programming, from Puterman’s Markov Decision Processes — chapter 1.

August 4, 2025 at 9:45 AM

joyjoyjoy597

@joyjoyjoyjoy597.bsky.social

www.youtube.com/watch?v=mDpS...
Not so fast!

Will New York DESTROY Trump's desperate JAILBREAK?

YouTube video by Talking Feds with Harry Litman

www.youtube.com

October 24, 2025 at 10:59 AM

ArXiv math.OC Optimization and Control

@optb0t.bsky.social

🔄 Updated Arxiv Paper

Title: Constrained Average-Reward Intermittently Observable MDPs
Authors: Konstantin Avrachenkov, Madhu Dhiman, Veeraruna Kavitha

Read more: https://arxiv.org/abs/2504.13823

September 9, 2025 at 8:16 AM

Uusimmat verkkotunnukset

@verkkotunnukset.bsky.social

Verkkotunnus: mdps-oy.fi
Rekisteröity: 2025-09-10 17:31
Käyttäjä: MDPS-Oy
Maa: Suomi
Välittäjä: Marcaria.com LLC

September 10, 2025 at 10:11 PM

Penakoto

@penakoto.bsky.social

Genshin Impact Teams - "Luna 2" (Nov 2025)

I haven't gotten anyone new, but seeing the upcoming gameplay Venti buffs and the Hexenzirkel buffs, inspired me to put together a team centred around MDPS Venti.

Also replaced him with Sucrose in my Electro-Charged team.

November 13, 2025 at 7:18 AM

John Asmuth

@skelterjohn.bsky.social

Have people in the #reinforcementlearning community been thinking about using LLMs to generate MDPs?

November 24, 2024 at 12:54 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news