banner
anaymehrotra.bsky.social
@anaymehrotra.bsky.social
PhD candidate @ Yale | Undergrad @ IITK | anaymehrotra.com

Learning Theory, Missing Data, Generation
📣 Excited to announce the Reliable ML workshop at neuripsconf.bsky.social‬ 2025!

How do we build trustworthy models under distribution shift, adversarial attacks, strategic behavior, and missing data?

→ Submission tracks: long (9 pg) and short (4 pg)
→ Deadline: Aug 22, 2025 (AOE)
Reliable ML from Unreliable Data — NeurIPS 2025 Workshop
ReliableMLWorkshop.github.io
July 28, 2025 at 5:13 PM
Slides 🪧 from our language generation tutorial are now up!

Check them out at languagegeneration.github.io

Recorded sessions coming – meanwhile also check out Jon's invited talk at ICML – icml.cc/virtual/2025... !
July 15, 2025 at 8:10 PM
If you are at COLT, join us for a tutorial on Language Generation on the first day!

The tutorial dives into Kleinberg and Mullainathan’s “generation in the limit” framework and the exciting space of works building on it.

🕤 9:30 AM–12:00 PM | Room C
🔗 languagegeneration.github.io
June 29, 2025 at 2:37 PM
Reposted
📣Join us at COLT 2025 in Lyon for a community event!
📅When: Mon, June 30 | 16:00 CET
What: Fireside chat w/ Peter Bartlett & Vitaly Feldman on communicating a research agenda, followed by mentorship roundtable to practice elevator pitches & mingle w/ COLT community!
let-all.com/colt25.html
June 24, 2025 at 6:22 PM
We are organizing a Language Generation tutorial @ #COLT 2025!

Visit our website (languagegeneration.github.io/) for references and materials; content updated regularly, check back for the latest!

Coorganizers: Moses Charikar, Chirag Pabbaraju, Charlotte Peale, Grigoris Velegkas

See you in Lyon!
June 11, 2025 at 10:20 PM
Reposted
The tutorials, workshops, and community events for #COLT2025 have been announced!

Exciting topics, and impressive slate of speakers and events, on June 30! The workshops have calls for contributions (⏰ May 16, 19, and 25): check them out!
learningtheory.org/colt2025/ind...
May 10, 2025 at 1:51 AM
@felix-zhou-cfz.bsky.social is giving two talks about this work at @uwaterloo.ca – one in the A&C seminar (May 14th), followed by a proof overview in the student seminar (May 15th)!
New paper w/ @felix-zhou-cfz.bsky.social & Alkis Kalavasis!

Result: Vanilla SGD (w/ warm start) solves regression with unknown-index self-selection bias

Our method speeds up earlier algorithms by Y. Cherapanamjeri, C. Daskalakis, @aifi.bsky.social, M. Zampetakis, J. Gaitonde, & E. Mossel
May 9, 2025 at 8:14 PM
New paper w/ @felix-zhou-cfz.bsky.social & Alkis Kalavasis!

Result: Vanilla SGD (w/ warm start) solves regression with unknown-index self-selection bias

Our method speeds up earlier algorithms by Y. Cherapanamjeri, C. Daskalakis, @aifi.bsky.social, M. Zampetakis, J. Gaitonde, & E. Mossel
May 9, 2025 at 8:11 PM
Reposted
This looks exciting! arxiv.org/abs/2504.160...
by Xi Chen, Shyamal Patel, and Rocco Servedio.

An exp(k^1/3)-query adaptive algo for tolerant testing of k-juntas ("is a Boolean function on n variables close from depending on only k variables?"), via a connection to agnostic learning conjunctions.
A Mysterious Connection Between Tolerant Junta Testing and Agnostically Learning Conjunctions
The main conceptual contribution of this paper is identifying a previously unnoticed connection between two central problems in computational learning theory and property testing: agnostically learnin...
arxiv.org
April 23, 2025 at 9:28 PM
Excellent talk by Jon Kleinberg at the institute for advanced studies on language generation—an exciting new area initiated by Jon and @sendhil.bsky.social, with contributors from many institutions (list below)

Link: www.youtube.com/watch?v=zlyr...
Language Generation in the Limit - Jon Kleinberg
YouTube video by Institute for Advanced Study
www.youtube.com
April 21, 2025 at 11:53 PM
Reposted
"What makes a good fisherman as opposed to other professions?"
This question can be formulated as a k-linear regression problem with self-selection bias.

Alkis, @anaymehrotra.bsky.social, and I design faster local convergence algorithms for this problem:
arxiv.org/abs/2504.07133

(1/7)
Can SGD Select Good Fishermen? Local Convergence under Self-Selection Biases and Beyond
We revisit the problem of estimating $k$ linear regressors with self-selection bias in $d$ dimensions with the maximum selection criterion, as introduced by Cherapanamjeri, Daskalakis, Ilyas, and Zamp...
arxiv.org
April 19, 2025 at 5:39 PM
Reposted
STOC Theory Fest in Prague June 23-27.

Registration now open. Early deadline is May 6.
acm-stoc.org/stoc202...

You can apply for student support. Deadline April 27.
acm-stoc.org/stoc202...
April 14, 2025 at 9:58 PM
Reposted
Taking a break from the submission season? Swing by the Workshop on Algorithms for Large Data (Online), WALDO 2025 🗓️ April 14—16: waldo-workshop.github.io/2025.html
Registration is free! (but necessary by April 7)
Workshop on Algorithms for Large Data (Online) 2025
waldo-workshop.github.io
April 4, 2025 at 6:44 AM
Reposted
I'm a fan of this post!
Accessible TeX colors
Ewin's website
ewintang.com
March 6, 2025 at 3:26 AM
Reposted
A reminder about NY Theory Day in a week! Fri Dec 6th! Talks by Amir Abboud, Sanjeev Khanna, Rotem Oshman, and Ron Rothblum! At NYU Tandon!

sites.google.com/view/nyctheo...

Registration is free, but please register for building access.

See you all there!
Home
About The New York Theory Day is a workshop aimed to bring together the theoretical computer science community in the New York metropolitan area for a day of interaction and discussion. The Theory Da...
sites.google.com
November 30, 2024 at 5:04 PM
We want language models that do not hallucinate

We want language models that have breadth (i.e., no mode-collapse)

Jon [email protected] asked: Can we get both?

Alkis Kalavasis, Grigoris Velegkas, and I show this is impossible: arxiv.org/abs/2411.09642

🧵(1/3)
November 25, 2024 at 7:29 PM
Reposted
I wrote a Part IV postscript to my job market blog post to add what I've learned as faculty.

TL;DR: No one is out to get you. For anything not going your way, it's probably due to people being busy or bureaucracy. And there are probably people working very hard for you behind the scenes regardless.
The Job Market (Parts I, II, III, & IV)
www.kiragoldner.com
November 19, 2024 at 2:19 PM
Reposted
A list of all the stats/modeling/ML/data starter packs I've seen (26+ and counting):
November 23, 2024 at 7:57 PM
Reposted
Optimist: The cup is half full
Pessimist: The cup is half empty
LaTeX user: The whole spacing and size are wrong, you should have been using \bigcup
Optimist: The cup is half full
Pessimist: The cup is half empty
STS researcher: The cup is an affordance
Optimist: The cup is half full
Pessimist: The cup is half empty
Cosmologist: The cup is dusty or full of gravitational waves
November 23, 2024 at 8:49 PM
Reposted
It's tough to gain visibility as a young researcher, and it's job market season! Are you a theoretical computer science PhD/postdoc on the job market?

I don't have a crazy juge audience but I'll try to help a bit: fill this form, and I'll tweet your pitch and info!
docs.google.com/forms/d/e/1F...
Theoretical CS Job Market 2024
docs.google.com
November 23, 2024 at 11:56 PM