Lightnews — Scholar-powered news

Ben Schneider

@bschneidr.bsky.social

730 followers 600 following 270 posts

Stats, surveys, R, and dogs.
www.practicalsignificance.com

Posts Replies Media Videos

Ben Schneider

@bschneidr.bsky.social

yeah uv is a real delight. I wish it worked for R too

November 19, 2025 at 3:02 AM

Ben Schneider

@bschneidr.bsky.social

Unfortunately the methods literature hasn't really solved this problem yet AFAIK.

This very recent paper tried to extend the Hyndman&Fan quantile estimators to work with weights and also creates estimators that depend on the arbitrary sort order in the data.

academic.oup.com/jssam/advanc...

INFERENCE FOR THE QUANTILE RATIO INEQUALITY INDEX IN THE CONTEXT OF SURVEY DATA

Abstract. There exist many statistical indicators for measuring economic inequality. Most of them rely on distribution moments or focus on a few selected p

academic.oup.com

November 14, 2025 at 2:33 PM

Ben Schneider

@bschneidr.bsky.social

Yeah, I think NSE is a huge reason why R can be more ergonomic. That aside, for a long time I thought I hated Python when it turned out I just hated pandas and matplotlib. I have gripes with polars but it’s so much better than pandas.

November 13, 2025 at 5:18 PM

Ben Schneider

@bschneidr.bsky.social

😬 what a mess. And yeah, in the short term when there are weights it makes me want to just settle and use type 2

November 8, 2025 at 4:14 PM

Ben Schneider

@bschneidr.bsky.social

The !! operator can stay as is, but when genzplyr is loaded it has to be pronounced “gang gang”

November 7, 2025 at 12:20 PM

Ben Schneider

@bschneidr.bsky.social

Thanks Matt! That’s good to hear- I’ll update the post to note that

November 6, 2025 at 3:41 PM

Ben Schneider

@bschneidr.bsky.social

Thanks for pointing that out, I’ll read up on it. It looks from the function documentation like it wouldn’t have the same reproducibility issue as the others: “the functions correctly combine weights of observations having duplicate values of x before computing estimates.”

November 6, 2025 at 12:41 PM

Ben Schneider

@bschneidr.bsky.social

It should be fine if you’re doing interpolation of the ECDF that corresponds to the Hyndman and Fan types 1-3 (and if that works for your problem domain). It’s just tricky once you get into types 4-9, like these various R packages do.

November 6, 2025 at 12:57 AM

Ben Schneider

@bschneidr.bsky.social

Please let me know if you find any clear docs about how SPSS computes weighted quantiles. I suspect SPSS only allows integer frequency weights, which is a much easier scenario than the general problem of arbitrary weights that the R packages are trying to deal with.

November 5, 2025 at 5:16 PM

Ben Schneider

@bschneidr.bsky.social

Here's a simple R example. Two R packages yield different results from each other, and both R packages will surprisingly return different results depending on the arbitrary way that the data are sorted. So you can easily wind up getting a different estimate on Tuesday than you got on Monday.

R code from the blog post, showing that weighted quantile functions return different results depending on the sort order of the data.

November 5, 2025 at 4:36 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news