Maria Antoniak
@mariaa.bsky.social
10K followers 1.3K following 2K posts
☀️ Assistant Professor of Computer Science at CU Boulder 👩‍💻 NLP, cultural analytics, narratives, online communities 🌐 https://maria-antoniak.github.io 💬 books, bikes, games, art
Posts Media Videos Starter Packs
Pinned
mariaa.bsky.social
some little bluesky tips 🦋

your blocks, likes, lists, and just about everything except chats are PUBLIC

you can pin custom feeds; i like quiet posters, best of follows, mutuals, mentions

if your chronological feed is overwhelming, you can make and pin make a personal list of "unmissable" people
Reposted by Maria Antoniak
bayesianboy.bsky.social
What problem is explainability/interpretability research trying to solve in ML, and do you have a favorite paper articulating what that problem is?
mariaa.bsky.social
"come on pleeeease we only want to persuade you of good things! please please just let us persuade you a little bit! we promise only for GOOD things! come on!"
mariaa.bsky.social
"nlp for social good"
unenthusiast.com
In honour of spooky month, share a 4 word horror story that only someone in your profession would understand.

rm -rf ~/
hammancheez.bsky.social
"The chancellor approved it"
Reposted by Maria Antoniak
What does an LLM do when it translates from Italian "amore" to Spanish "amor" or French "amour"?

That's easy! (you might think) Because surely it knows: amore, amor, amour are all based on the same Latin word. It can just drop the "e", or add a "u".
Reposted by Maria Antoniak
cmyeaton.bsky.social
For the 2nd week in a row, the federal shutdown has blocked disease surveillance reporting. So Team Force of Infection again visited all 50 state websites to track COVID, flu & RSV. caitlinrivers.substack.com/p/outbreak-o...
Outbreak Outlook: Week 2 of DIY Surveillance
State by state disease surveillance of COVID-19, influenza and RSV amid federal public health cuts
caitlinrivers.substack.com
Reposted by Maria Antoniak
informor.bsky.social
This post violates my pet peeve -- don't list a date without a day-of-week if you want people to sign up -- but this protest is the most important thing you can do this coming SATURDAY.

You've posted on Bluesky enough. We need everyone on the streets. #nokings
mariaa.bsky.social
The #COLM2025 workshop on NLP4Democracy is starting now! Join us in 520E.

I’ll be speaking at 10:15am with @ysiglidis.bsky.social about work with @iaugenstein.bsky.social and @serge.belongie.com focused on tracking collective narratives on social media.
A slide that reads “NLP and all the interesting and weird ways it intersects with processes and values that comprise democracy”
Reposted by Maria Antoniak
sunniesuhyoung.bsky.social
Our Responsible AI team at Apple is looking for spring/summer 2026 PhD research interns! Please apply at jobs.apple.com/en-us/detail... and email [email protected]. Do not send extra info (e.g., CV), just drop us a line so we can find your application in the central pool!
Machine Learning / AI Internships - Jobs - Careers at Apple
Apply for a Machine Learning / AI Internships job at Apple. Read about the role and find out if it’s right for you.
jobs.apple.com
mariaa.bsky.social
Closing session for #COLM2025!

There will be #COLM2026! @yoavartzi.com and @gregdnlp.bsky.social will be organizing. Location TBD.

Full day of workshops tomorrow, check the program.
Reposted by Maria Antoniak
dmimno.bsky.social
He had physical copies of at least six books at the podium and read passages from them. His main point was that regardless of whether you're more concerned about Big Risks or current harms, the "everything is great" position is untenable for anyone.
Reposted by Maria Antoniak
thomasdavidson.bsky.social
There is one week left to apply to join us at Rutgers! We're hiring an Assistant Professor in Computational Sociology as part of a cluster of new hires in data science and AI.

Applications are due next Wednesday, 10/15.
Assistant Professor in Computational Sociology
The Department of Sociology at Rutgers University, New Brunswick, seeks applications for a tenure-track position at the Assistant Professor level specializing in Computational Sociology.  The search i...
jobs.rutgers.edu
mariaa.bsky.social
Another oral talk, this one by @stellali.bsky.social, discussing "PrefPalette: Personalized Preference Modeling with Latent Attributes."

Communities prefer different kinds of responses, which prioritize specific values. Aggregating preferences over communities would lose that signal.

#COLM2025
PrefPalette: Personalized Preference Modeling with Latent Attributes
Personalizing AI systems requires understanding not just what users prefer, but the reasons that underlie those preferences - yet current preference models typically treat human judgment as a black bo...
arxiv.org
mariaa.bsky.social
At the end of the talk, he said he couldn't provide a full answer to that question but that we should each consider it. He said that he thinks there are potential upsides that he finds worth it, and that's why he's working on LMs. But he's not sure.

(paraphrasing a little bit from memory)
mariaa.bsky.social
Now we're hearing a talk by @valentinhofmann.bsky.social about "Fluid Language Model Benchmarking."

Computerized adaptive testing is used for humans (like the GRE). This work adopts Item Response Theory from education to measure benchmark characteristics.

#COLM2025
Fluid Language Model Benchmarking
Language model (LM) benchmarking faces several challenges: comprehensive evaluations are costly, benchmarks often fail to measure the intended capabilities, and evaluation quality can degrade due to l...
arxiv.org
mariaa.bsky.social
Question from the audience (didn't catch the name): Why are computer scientists the ones who should solve this problem? They lack expertise, and there are other people who have been studying these kinds of harms for a very long time.

#COLM2025
mariaa.bsky.social
"What problems you're scared of depend on how good you think the LLMs will get"

"Please be willing to change your mind."

"This is COLM. We made the models, it's our job to fix it. How are you going to change your research agenda?"

#COLM2025
mariaa.bsky.social
"I'm not arguing that everyone should work on all of these problems, I don't know how to work on this problem. But we should work on them scientifically and on a spectrum of problems."

Argues that these different kinds of risks are NOT just distractions from each other.

#COLM2025
mariaa.bsky.social
Final risk is misalignment. (NB: You-know-who again. I'm not going to bother advertising.)

Nicholas expresses some skepticism but also asks that we don't immediately dismiss these risks.

"That's something that only happens in scifi... well then we live in scifi."

#COLM2025
mariaa.bsky.social
Another example of misuse: dangerous capabilities (bio weapons), both OpenAI and Anthropic have expensive safeguards that run on every single query to look for these kinds of dangers

Are bio weapons the most important risk? Maybe, maybe not

#COLM2025
mariaa.bsky.social
Another example of misuse: mass surveillance, "with language models you have the potential to watch everyone"

Quotes Larry Ellison on how "citizens will be on their best behavior" and notes that Oracle recently invested billions in OpenAI

#COLM2025
mariaa.bsky.social
Misuse: Shows benchmark showing increasing success of models at identifying vulnerabilities, discusses ransomware at scale (they ran simulations on the Enron email dataset, Claude automatically found someone have an affair) and the first real example of this was recently discovered

#COLM2025
mariaa.bsky.social
Now job replacement. "Things might be fine in 30 years but the next 20 years could be hard."

"Sometimes people claim that this worry is just something people say to sell stuff, but Dario has proposed taxing tokens (?) and offering to lose money is a sign people really believe this"

#COLM2025
mariaa.bsky.social
Now discussing misinformation and Elon Musk.

"Previously no one person had so much power over information. If you control the language model, you have a significant amount of power about how people see the world."

#COLM2025