Stefano Palminteri
@stepalminteri.bsky.social
1.4K followers 450 following 140 posts
Computational cognitive scientist interested in learning and decision-making in human and machiches Research director of the Human Reinforcement Learning team Ecole Normale Supérieure (ENS) Institut National de la Santé et Recherche Médicale (INSERM)
Posts Media Videos Starter Packs
Pinned
stepalminteri.bsky.social
New paper our in @pnas.org, lead by @isabellehoxha.bsky.social with Léo Sperber. We use evolutionary simulation to assess and compare the adaptive value of positivity bias and gradual perseveration in reinforcement learning. Follow the thread below (and Isabelle!) for more details!
isabellehoxha.bsky.social
Ever wondered why you keep going to that restaurant with stale fries? Is it because you went often in the past (perseveration) or because you remember past good experiences better (positivity bias)? Our study out in PNAS investigates the normative basis for these biases www.pnas.org/doi/10.1073/...
Evolving choice hysteresis in reinforcement learning: Comparing the adaptive value of positivity bias and gradual perseveration | PNAS
The tendency to repeat past choices more often than expected from the history of outcomes has been repeatedly empirically observed in reinforcement...
www.pnas.org
stepalminteri.bsky.social
I guess the missing link here is "However, we find that even if the agent updates its belief via, arguably objective, Bayesian inference, fitting the above model demonstrates both the biases". I working under the assumption that the Bayes solution is understood as normative given the task here
stepalminteri.bsky.social
If you want to know more about the reinforcement learning biases framework, I summarised it here:

www.researchgate.net/publication/...
stepalminteri.bsky.social
I am very humbled that during the past years so many smart people took seriously our research questions and results to push forward our understanding.
On the specific subject matter (bias or optimal) I am still persuaded that it is a bias, that just happens to be generally optimal 😉
Reposted by Stefano Palminteri
stepalminteri.bsky.social
Thought experiments such as the Blockhead and Super-Super Spartans are often taken as “definitive” arguments against behavior-based inference of cognitive processes.
In our review -with @thecharleywu.bsky.social- we argue they may not be as definitive as originally thought.
Reposted by Stefano Palminteri
anllohernan.bsky.social
I haven't given any news a while, I've been nose deep into this novel preprint with my excellent collaborators @stepalminteri.bsky.social, @urihertz.bsky.social and Bahador Bahrami: "Uncovering the semantics of teaching in
experiential learning with Large Language Models".
doi.org/10.31234/osf...
OSF
doi.org
stepalminteri.bsky.social
Thought experiments such as the Blockhead and Super-Super Spartans are often taken as “definitive” arguments against behavior-based inference of cognitive processes.
In our review -with @thecharleywu.bsky.social- we argue they may not be as definitive as originally thought.
Reposted by Stefano Palminteri
stepalminteri.bsky.social
New (revised) preprint with @thecharleywu.bsky.social
We rethink how to assess machine consciousness: not by code or circuitry, but by behavioral inference—as in cognitive science.
Extraordinary claims still need extraordinary evidence.
👉 osf.io/preprints/ps...
#AI #Consciousness #LLM
stepalminteri.bsky.social
New (revised) preprint with @thecharleywu.bsky.social
We rethink how to assess machine consciousness: not by code or circuitry, but by behavioral inference—as in cognitive science.
Extraordinary claims still need extraordinary evidence.
👉 osf.io/preprints/ps...
#AI #Consciousness #LLM
Reposted by Stefano Palminteri
stepalminteri.bsky.social
This book by @anilananth.bsky.social is great — perfect for those, like me, who have an intuitive and geometric grasp of math but unfortunately no formal training. Highly recommended!
stepalminteri.bsky.social
This book by @anilananth.bsky.social is great — perfect for those, like me, who have an intuitive and geometric grasp of math but unfortunately no formal training. Highly recommended!
Reposted by Stefano Palminteri
stepalminteri.bsky.social
Preprint alert! Navigating Inflationary and Deflationary Claims Concerning Large Language Models Avoiding Cognitive Biases.
Very fun and efficient collaboration with @giadapistilli.com
To help cognitively bounded humans balancing hype and dismall of LLMs capabilities
osf.io/preprints/ps...
OSF
osf.io
stepalminteri.bsky.social
Check out @bcdavidson.bsky.social's preprint (w/ @georgiaturner.bsky.social @orbenamy.bsky.social @livia-tomova.bsky.social and co.) about the (computational) consequences of social isolation in social media use during covid!
bcdavidson.bsky.social
🚨 New Preprint 🚨

Prolonged Isolation is associated with an increased behavioural sensitivity to ‘Likes’ on social media.

🧵

Social media rewards are inherently social—but does posting change during social isolation, when in-person social rewards are limited?

It turns out, yes!
stepalminteri.bsky.social
Braitenberg's Vehicles arrived yesterday and I'm already halfway through it. An amazingly funny, clear, and lucid treatment of the question of attributing higher cognitive functions to artificial systems. Obviously very timely for current debates in AI
stepalminteri.bsky.social
This is the link to the previous study that served to the bases for our recent @pnas.org study on the optimality of choice-confirmation bias and perseveration.

"Choice-Confirmation Bias and Gradual Perseveration in Human Reinforcement Learning"

Open here:
www.researchgate.net/publication/...
Reposted by Stefano Palminteri
sjblakemore.bsky.social
New paper! By @livia-tomova.bsky.social,
@emilyanntowner.bsky.social, Kirsten Thomas,
@stepalminteri.bsky.social, @l32zhang.bsky.social

Acute isolation is associated with increased reward seeking and reward learning in human adolescents.

www.nature.com/articles/s44...
Reposted by Stefano Palminteri
stepalminteri.bsky.social
New paper our in @pnas.org, lead by @isabellehoxha.bsky.social with Léo Sperber. We use evolutionary simulation to assess and compare the adaptive value of positivity bias and gradual perseveration in reinforcement learning. Follow the thread below (and Isabelle!) for more details!
isabellehoxha.bsky.social
Ever wondered why you keep going to that restaurant with stale fries? Is it because you went often in the past (perseveration) or because you remember past good experiences better (positivity bias)? Our study out in PNAS investigates the normative basis for these biases www.pnas.org/doi/10.1073/...
Evolving choice hysteresis in reinforcement learning: Comparing the adaptive value of positivity bias and gradual perseveration | PNAS
The tendency to repeat past choices more often than expected from the history of outcomes has been repeatedly empirically observed in reinforcement...
www.pnas.org
stepalminteri.bsky.social
New paper our in @pnas.org, lead by @isabellehoxha.bsky.social with Léo Sperber. We use evolutionary simulation to assess and compare the adaptive value of positivity bias and gradual perseveration in reinforcement learning. Follow the thread below (and Isabelle!) for more details!
isabellehoxha.bsky.social
Ever wondered why you keep going to that restaurant with stale fries? Is it because you went often in the past (perseveration) or because you remember past good experiences better (positivity bias)? Our study out in PNAS investigates the normative basis for these biases www.pnas.org/doi/10.1073/...
Evolving choice hysteresis in reinforcement learning: Comparing the adaptive value of positivity bias and gradual perseveration | PNAS
The tendency to repeat past choices more often than expected from the history of outcomes has been repeatedly empirically observed in reinforcement...
www.pnas.org
Reposted by Stefano Palminteri
isabellehoxha.bsky.social
Ever wondered why you keep going to that restaurant with stale fries? Is it because you went often in the past (perseveration) or because you remember past good experiences better (positivity bias)? Our study out in PNAS investigates the normative basis for these biases www.pnas.org/doi/10.1073/...
Evolving choice hysteresis in reinforcement learning: Comparing the adaptive value of positivity bias and gradual perseveration | PNAS
The tendency to repeat past choices more often than expected from the history of outcomes has been repeatedly empirically observed in reinforcement...
www.pnas.org