Ben Schneider
banner
bschneidr.bsky.social
Ben Schneider
@bschneidr.bsky.social
Stats, surveys, R, and dogs.
www.practicalsignificance.com
yeah uv is a real delight. I wish it worked for R too
November 19, 2025 at 3:02 AM
Unfortunately the methods literature hasn't really solved this problem yet AFAIK.

This very recent paper tried to extend the Hyndman&Fan quantile estimators to work with weights and also creates estimators that depend on the arbitrary sort order in the data.

academic.oup.com/jssam/advanc...
INFERENCE FOR THE QUANTILE RATIO INEQUALITY INDEX IN THE CONTEXT OF SURVEY DATA
Abstract. There exist many statistical indicators for measuring economic inequality. Most of them rely on distribution moments or focus on a few selected p
academic.oup.com
November 14, 2025 at 2:33 PM
Yeah, I think NSE is a huge reason why R can be more ergonomic. That aside, for a long time I thought I hated Python when it turned out I just hated pandas and matplotlib. I have gripes with polars but it’s so much better than pandas.
November 13, 2025 at 5:18 PM
😬 what a mess. And yeah, in the short term when there are weights it makes me want to just settle and use type 2
November 8, 2025 at 4:14 PM
The !! operator can stay as is, but when genzplyr is loaded it has to be pronounced “gang gang”
November 7, 2025 at 12:20 PM
Thanks Matt! That’s good to hear- I’ll update the post to note that
November 6, 2025 at 3:41 PM
Thanks for pointing that out, I’ll read up on it. It looks from the function documentation like it wouldn’t have the same reproducibility issue as the others: “the functions correctly combine weights of observations having duplicate values of x before computing estimates.”
November 6, 2025 at 12:41 PM
It should be fine if you’re doing interpolation of the ECDF that corresponds to the Hyndman and Fan types 1-3 (and if that works for your problem domain). It’s just tricky once you get into types 4-9, like these various R packages do.
November 6, 2025 at 12:57 AM
Please let me know if you find any clear docs about how SPSS computes weighted quantiles. I suspect SPSS only allows integer frequency weights, which is a much easier scenario than the general problem of arbitrary weights that the R packages are trying to deal with.
November 5, 2025 at 5:16 PM
Here's a simple R example. Two R packages yield different results from each other, and both R packages will surprisingly return different results depending on the arbitrary way that the data are sorted. So you can easily wind up getting a different estimate on Tuesday than you got on Monday.
November 5, 2025 at 4:36 PM