These findings challenge assumptions in our field: that numeric ratings function as uniformly interpretable indicators of suicidal thinking.
Mid scale ratings often reflect qualitatively different experiences, not merely different intensities.
These findings challenge assumptions in our field: that numeric ratings function as uniformly interpretable indicators of suicidal thinking.
Mid scale ratings often reflect qualitatively different experiences, not merely different intensities.
What we found:
1️⃣ Shared meaning emerged only at the ends of the scale.
2️⃣ The middle of the scale was a conceptual gray zone.
3️⃣ Between person consistency never exceeded ~ 20%.
4️⃣ Within person consistency was strongest at scale endpoints.
5️⃣LLM refinement outperformed BERTopic alone.
What we found:
1️⃣ Shared meaning emerged only at the ends of the scale.
2️⃣ The middle of the scale was a conceptual gray zone.
3️⃣ Between person consistency never exceeded ~ 20%.
4️⃣ Within person consistency was strongest at scale endpoints.
5️⃣LLM refinement outperformed BERTopic alone.
Using a 2 stage NLP pipeline (BERTopic -> LLM refinement), we extracted coherent themes from thousands of participant responses and mapped them onto the numeric rating scale.
We also looked at within person consistency (across time) and between person consistency (across people).
Using a 2 stage NLP pipeline (BERTopic -> LLM refinement), we extracted coherent themes from thousands of participant responses and mapped them onto the numeric rating scale.
We also looked at within person consistency (across time) and between person consistency (across people).
Across 2 independent cohorts of adolescents and young adults, participants were randomly assigned number ratings (0-10) of suicide urge and provided open ended descriptions of what thoughts they would be having at those ratings.
Across 2 independent cohorts of adolescents and young adults, participants were randomly assigned number ratings (0-10) of suicide urge and provided open ended descriptions of what thoughts they would be having at those ratings.
We usually treat numeric ratings as if they directly reflect suicide risk. But psychological measurement tells us something trickier:
People map internal states onto numbers in highly personal ways.
We usually treat numeric ratings as if they directly reflect suicide risk. But psychological measurement tells us something trickier:
People map internal states onto numbers in highly personal ways.