Without solid data, we risk both hype and harm. Thread 👇
Without solid data, we risk both hype and harm. Thread 👇
Describes how to mature eval so systems can be worthy of trust and safely deployed.
Describes how to mature eval so systems can be worthy of trust and safely deployed.
1) Meaningful metrics: evaluation metrics must connect to AI system behaviour or impact that is of relevance in the real-world. They can be abstract or simplified -- but they need to correspond to real-world performance or outcomes in a meaningful way.
1) Meaningful metrics: evaluation metrics must connect to AI system behaviour or impact that is of relevance in the real-world. They can be abstract or simplified -- but they need to correspond to real-world performance or outcomes in a meaningful way.
My research interests include optimization, federated learning, machine learning, privacy, and unlearning.
1/n
1/n
digital-strategy.ec.europa.eu/en/library/s...
digital-strategy.ec.europa.eu/en/library/s...
You jailbreak every model on the market😱😱😱
Fire work led by @jplhughes.bsky.social
Sara Price @aengusl.bsky.social Mrinank Sharma
Ethan Perez
arxiv.org/abs/2412.03556
You jailbreak every model on the market😱😱😱
Fire work led by @jplhughes.bsky.social
Sara Price @aengusl.bsky.social Mrinank Sharma
Ethan Perez
arxiv.org/abs/2412.03556
Join our expert panellists* for a timely discussion on “Rethinking fairness in the era of large language models”!!
* @jessicaschrouff.bsky.social, @sethlazar.org, Sanmi Koyejo, Hoda Heidari
Join our expert panellists* for a timely discussion on “Rethinking fairness in the era of large language models”!!
* @jessicaschrouff.bsky.social, @sethlazar.org, Sanmi Koyejo, Hoda Heidari
Led by @poonpura.bsky.social , who is applying for PhD programs this year 🚀
w/ @poonpura.bsky.social , Wei-Ning Chen, Sanmi Koyejo, Albert No
Led by @poonpura.bsky.social , who is applying for PhD programs this year 🚀
w/ @poonpura.bsky.social , Wei-Ning Chen, Sanmi Koyejo, Albert No
📄: arxiv.org/abs/2411.14639
🙌: big thank you to my collaborators and mentors Wei-Ning Chen, @berivanisik.bsky.social, Sanmi Koyejo, Albert No
🧵 16/16
📄: arxiv.org/abs/2411.14639
🙌: big thank you to my collaborators and mentors Wei-Ning Chen, @berivanisik.bsky.social, Sanmi Koyejo, Albert No
🧵 16/16