Michelle L. Ding
@michelleding.bsky.social
93 followers 58 following 22 posts
organizer/researcher critically investigating how AI systems impact communities. cs phd @ brown cntr. she/her. 🌷 https://michelle-ding.github.io/
Posts Media Videos Starter Packs
Pinned
michelleding.bsky.social
Hi #COLM2025! 🇨🇦 I will be presenting a talk on the importance of community-driven LLM evaluations based on an opinion abstract I wrote with Jo Kavishe, @victorojewale.bsky.social and @geomblog.bsky.social tomorrow at 9:30am in 524b for solar-colm.github.io

Hope to see you there!
Third Workshop on Socially Responsible Language Modelling Research (SoLaR) 2025
COLM 2025 in-person Workshop, October 10th at the Palais des Congrès in Montreal, Canada
solar-colm.github.io
michelleding.bsky.social
More resources here: bit.ly/solar-cntr 🌱 Happy to chat with anyone interested in learning more!
Grey background with black text that says:

Slide 1: What can you do as a researcher?
Look into your local community - develop sustainable, long term partnerships with city-level or state-level coalitions, networks, news agencies, or non-profit organizations that do on the ground work to discover harms
University researchers – look beyond your department into other disciplines in the humanities and social sciences to ground your research in interdisciplinary methods
Industry researchers – look towards trust & safety teams, policy teams, human rights teams and establish collaborations where research reflects real uses. Or advocate to (re)establish these teams.


Slide 2: Contribute to a crowdsourced, open source database: bit.ly/solar-cntr

Case studies are a great way for us to share, learn, teach, and reflect on methods together. 
At SoLaR 2025, we hope to crowdsource a spreadsheet of community-driven LLM evaluation case studies that orient us towards contextualized, participatory, expertise driven evaluations. 
Please add any your own or related work to our spreadsheet :) And let’s stay connected! We also have a tab for contact info!
michelleding.bsky.social
Thank you everyone who came to our talk! Here are some highlights:
Brown background and CNTR logo with white text that says:

Community-driven evaluations yield good scientific insights. 

They strip away marketing, hype, and unnecessary abstractions

And reveal the contexts where LLMs might genuinely add value and where they do not.

This opens space for meaningful dialogue between stakeholders across academia, industry, government, and civil society

And empowers the public to engage with and assess for themselves whether the tools they are being presented with can be used responsibly.
Reposted by Michelle L. Ding
valentinapy.bsky.social
💡We kicked off the SoLaR workshop at #COLM2025 with a great opinion talk by @michelleding.bsky.social & Jo Gasior Kavishe (joint work with @victorojewale.bsky.social and
@geomblog.bsky.social
) on "Testing LLMs in a sandbox isn't responsible. Focusing on community use and needs is."
Reposted by Michelle L. Ding
dippedrusk.com
Have you or a loved one been misgendered by an LLM? How can we evaluate LLMs for misgendering? Do different evaluation methods give consistent results?
Check out our preprint led by the newly minted Dr. @arjunsubgraph.bsky.social, and with Preethi Seshadri, Dietrich Klakow, Kai-Wei Chang, Yizhou Sun
Agree to Disagree? A Meta-Evaluation of LLM Misgendering
Numerous methods have been proposed to measure LLM misgendering, including probability-based evaluations (e.g., automatically with templatic sentences) and generation-based evaluations (e.g., with aut...
arxiv.org
michelleding.bsky.social
Hi #COLM2025! 🇨🇦 I will be presenting a talk on the importance of community-driven LLM evaluations based on an opinion abstract I wrote with Jo Kavishe, @victorojewale.bsky.social and @geomblog.bsky.social tomorrow at 9:30am in 524b for solar-colm.github.io

Hope to see you there!
Third Workshop on Socially Responsible Language Modelling Research (SoLaR) 2025
COLM 2025 in-person Workshop, October 10th at the Palais des Congrès in Montreal, Canada
solar-colm.github.io
Reposted by Michelle L. Ding
reniebird.bsky.social
I'll be presenting a position paper about consumer protection and AI in the US at ICML. I have a surprisingly optimistic take: our legal structures are stronger than I anticipated when I went to work on this issue in Congress.

Is everything broken rn? Yes. Will it stay broken? That's on us.
A poster for the paper "Position: Strong Consumer Protection is an Inalienable Defense for AI Safety in the United States"
Reposted by Michelle L. Ding
techpolicypress.bsky.social
With their 'Sovereignty as a Service' offerings, tech companies are encouraging the illusion of a race for sovereign control of AI while being the true powers behind the scenes, write Rui-Jie Yew, Kate Elizabeth Creasey, Suresh Venkatasubramanian.
'Sovereignty' Myth-Making in the AI Race | TechPolicy.Press
Tech companies stand to gain by encouraging the illusion of a race for 'sovereign' AI, write Rui-Jie Yew, Kate Elizabeth Creasey, Suresh Venkatasubramanian.
www.techpolicy.press
Reposted by Michelle L. Ding
victorojewale.bsky.social
Excited to present "Towards AI Accountability Infrastructure: Gaps and Opportunities in AI Audit Tooling" at #CHI2025 tomorrow(today)!

🗓 Tue, 29 Apr | 9:48–10:00 AM JST (Mon, 28 Apr | 8:48–9:00 PM ET)
📍 G401 (Pacifico North 4F)

📄 dl.acm.org/doi/10.1145/...
michelleding.bsky.social
all welcomes are good welcomes even with a 4 year delay 👍
michelleding.bsky.social
Also...super grateful & happy to be continuing my research journey at Brown in a CS PhD under @geomblog.bsky.social and @harinisuresh.bsky.social 🌱Many papers to come (?)
Reposted by Michelle L. Ding
harinisuresh.bsky.social
@michelleding.bsky.social has been doing amazing work laying out the complex landscape of "deepfake porn" and distilling the unique challenges in governing it. We hope this work informs future AI governance efforts to address the severe harms of this content - reach out to us to chat more!
michelleding.bsky.social
Any synthetic content governance framework or method that aims to target AIG-NCII of adults should also consider their applicability to the malicious technical ecosystem and the three limitations we point out above. More during our presentation at chi-staig.github.io!
STAIG @ CHI 2025
chi-staig.github.io
michelleding.bsky.social
3. Adult AIG-NCII specific methods often focus on red teaming general purpose image generation models eg Stable Diffusion. While this is also important, there is an erroneous assumption of a "trustworthy technology" and "malicious users" when in fact the tech itself here is maliciously designed
michelleding.bsky.social
2. AIG-NCII prevention methods often conflate child sexual abuse materials (CSAM) with NCII of adults. But methods for CSAM (that often work with law enforcement databases) won't be as effective for adults due to different legal protections
michelleding.bsky.social
1. Transparency methods are very common in synthetic content governance frameworks but they are not enough. DF porn is not the same as other eg. political deepfakes bc human detectable images are still harmful. In fact, many of the deepfake creators label images as "fake" to avoid accountability