Nikhil Garg
nkgarg.bsky.social
Nikhil Garg
@nkgarg.bsky.social
I study algorithms/learning/data applied to democracy/markets/society. Asst. professor at Cornell Tech. https://gargnikhil.com/. Helping building personalized Bluesky research feed: https://bsky.app/profile/paper-feed.bsky.social/feed/preprintdigest
For example, the NYC Dept of Parks and Recreation no longer crowdsources requests to plant trees. Public data illustrates why: requests are highly correlated with neighborhood income, and *negatively* a measure of need, the heat vulnerability index
August 11, 2025 at 1:25 PM
New piece, out in the Sigecom Exchanges! It's my first solo-author piece, and the closest thing I've written to being my "manifesto." #econsky #ecsky
arxiv.org/abs/2507.03600
August 11, 2025 at 1:25 PM
Map from @thecity.nyc is much more detailed

www.thecity.nyc/2025/06/24/m...
June 25, 2025 at 2:11 AM
This NYT map illustrates the weirdness of looking just at first place votes. UWS is colored for Cuomo even though he is doing (slightly) worse there than city-wide, because Lander got a good chunk of first place votes
June 25, 2025 at 2:10 AM
Nothing sophisticated, but daily active users of the Paper Skygest went from around 900 to 1100 at the time of your post. Those two spikes in end April and early May are when your post took off, I think
June 22, 2025 at 5:56 PM
See very last tweet...I had privately thought it bad form to imply that it was fake without evidence, but the update seems to vindicate him.
May 16, 2025 at 4:40 PM
Good point, it's hard to tell from Figure 1 what the very recent absolute number trend is. @sjgreenwood.bsky.social at some point we should look at this with our data too!
May 15, 2025 at 12:59 PM
We end with recommendations for what the discrimination auditing and related literatures should do, when it needs to impute race/ethnicity. tl;dr: use continuous scores whenever possible! Basically never use argmax/thresholding.
February 27, 2025 at 7:43 PM
As an optimization problem, we formalize this problem of discretizing probabilities and its tradeoffs and find that a joint optimization approach can mitigate these distributional skews with negligible loss in individual prediction accuracy, while using *no additional data*.
February 27, 2025 at 7:43 PM
e.g., naive discretization methods underestimate downstream disparities: building on work by Argyle and Barber, we find discretization underestimates Black voter turnout in NC, with errors correlating with household income. A continuous estimator is unbiased.

www.cambridge.org/core/journal...
February 27, 2025 at 7:43 PM
These skews can affect downstream tasks, for example discrimination auditing or electoral outreach analyses that rely on either the counts, geographic distribution, or individual labels to be accurate or even unbiased.
February 27, 2025 at 7:43 PM
We show that the most common, “argmax” discretization method – taking the most likely category – skews the predicted population. In our empirical application, a real-world voter file undercounts the Black voter population by 28% in NC, with further geographic skews.
February 27, 2025 at 7:43 PM
Now online @pnasnexus.org! Many discrimination auditing and electoral tasks use ML to predict race/ethnicity – by discretizing continuous scores. Can the discretization process cause bias in labels and downstream tasks? Yes! Led by @evandyx.bsky.social

academic.oup.com/pnasnexus/ar...
February 27, 2025 at 7:43 PM
The entire chat is pretty long/about an ongoing paper we're writing, but here's the relevant portion.

The first paper (Manish's) is the relevant one. arxiv.org/abs/2412.08610. This chat occurred the day after it appeared on arxiv.

(it does get every detail wrong except title, author, id)
January 12, 2025 at 4:56 PM
Also check out Kenny's paper + poster on algorithmic monoculture in matching markets. We find that, with modeling market effects (e.g., applicants also have preferences over firms), intuitions about monoculture being bad for applicants become more nuanced.

Link: arxiv.org/abs/2312.09841
December 11, 2024 at 4:21 PM
This seems fantastic! "The primary objective was to address current challenges and advance the ongoing discourse on the evaluation of recommender systems. The participants’ diverse backgrounds and perspectives on evaluation significantly contributed to the discourse on this subject."
December 2, 2024 at 5:17 PM