Saffron Huang
banner
saffron.bsky.social
Saffron Huang
@saffron.bsky.social
how shall we live together?

societal impacts researcher at Anthropic
saffronhuang.com
Reposted by Saffron Huang
In the latest essay in our AI & Democratic Freedoms series, @lujain.bsky.social, @saffron.bsky.social, @umangsbhatt.bsky.social‬, Lama Ahmad, and Markus Anderljung propose a new AI evaluation paradigm that assesses the harms that can emerge from repeated human-AI interactions.
Towards Interactive Evaluations for Interaction Harms in Human-AI Systems
knightcolumbia.org
June 23, 2025 at 5:51 PM
Reposted by Saffron Huang
“As AI advances, many are worried about the tech's potential to concentrate unprecedented wealth among a few, while eroding the economic value of human work for everyone else.”

Here’s how @saffron.bsky.social & Sam Manning believe we can stop that from happening.

#ai #wealthinequality #economy
Here’s How To Share AI’s Future Wealth | NOEMA
Advanced AI threatens to increase inequality and concentrate power, but we can proactively distribute AI’s benefits to foster a just and inclusive economy before it’s too late.
www.noemamag.com
April 22, 2025 at 5:55 PM
Really proud and excited to release work on empirically measuring AI values “in the wild” — understanding, analyzing and taxonomizing what values guide model outputs in real interactions with real users.

www.anthropic.com/research/val...
April 21, 2025 at 3:54 PM
This piece I wrote is now in the Stanford CS ethics curriculum! honestly, the exact kind of audience i wanted, so 🥹.

(Also I do actually still think this piece is ~my compass for what technology is and what it means to build it!)

www.kernelmag.io/1/what-is-te...
March 20, 2025 at 1:59 AM
Reviving my Substack to try to describe something that is very difficult to describe (a near death experience 20 years after it happened) saffron.substack.com/p/something-...
something divine shook me by the shoulders
when you see so clearly that everything on the outside is really on the inside
saffron.substack.com
January 26, 2025 at 5:11 PM
oh my god i love this one:

Q: Which philosopher/logician identified an inconsistency in the US Constitution. Einstein tried (and failed) to persuade him not to point this out during his US citizenship test.
A: Godel
Gonna restart the philosophy quiz started at other place – questions to be set at irregular intervals. Here were the first 20 questions.
January 17, 2025 at 8:23 PM
one of my favourite chinese words is 时光 which Google-translates as ‘time’ but really it’s ’time-light’. as in, ‘the wonderful time-light we spent together’
January 14, 2025 at 4:19 PM
Reposted by Saffron Huang
The future is here, and it should be co-created.

These global dialogues convene thousands of people from around the world to set a vision - and concrete goals - for what world they want.

Our first dialogue centers around the fears, dreams, hopes, and attitudes people have about AI.
December 20, 2024 at 1:36 AM
Reposted by Saffron Huang
How are AI Assistants being used in the real world?

Our new research shows how to answer this question in a privacy preserving way, automatically identifying trends in Claude usage across the world.

1/
December 12, 2024 at 9:37 PM
Reposted by Saffron Huang
read the paper — there are some fun anecdotes! www.anthropic.com/research/clio
Clio is Anthropic's new system for identifying AI risks that it hadn't thought to look for — what it calls the unknown unknowns. I talked with team that built it and share for the first time the top three ways people use Claude www.platformer.news/how-claude-u...
December 12, 2024 at 9:15 PM
SO excited to share Clio with the world (and on Bsky before Twitter)!

Clio generates insights on AI usage patterns, in a way that keeps user data private. It has unlocked, and will continue to unlock, an immense amount of understanding about the present and future of AI use.

(Blog linked below)
Clio is Anthropic's new system for identifying AI risks that it hadn't thought to look for — what it calls the unknown unknowns. I talked with team that built it and share for the first time the top three ways people use Claude www.platformer.news/how-claude-u...
December 12, 2024 at 9:35 PM
great news: using AI to generate a silly little short film for my friend's birthday. its so hard to get anything you actually want to happen, happen, consistency is terrible, and what it gives you is so diff from regular filmmaking

this overall makes me happy although i am currently suffering
December 5, 2024 at 5:05 AM
computer history museum highlights: punched-card distribution maps of plant and flower species on the british isles
November 30, 2024 at 10:15 PM
macbooks are cool, but they can’t print your name in ASCII art on an impact printer that outputs 600 lines per min and smells like an engine
November 30, 2024 at 10:08 PM
‘the light in San Francisco is like golden hour all day’ - friend
November 30, 2024 at 3:57 PM
learned a startling amount from this about crime/drugs/addiction & what happens when markets meet vices & vulnerability open.spotify.com/episode/2L2A...
The Hidden Politics of Disorder
The Ezra Klein Show · Episode
open.spotify.com
November 27, 2024 at 5:47 AM
Reposted by Saffron Huang
Suing Kendrick for having a superior diss track is something Kendrick would say Drake does in a diss track
November 25, 2024 at 10:22 PM
@divya.bsky.social and I contributed an essay to this volume on AI morality, edited by the inimitable @davidedmonds100.bsky.social (which delights me as a big Philosophy Bites fan)

it's out now & the contributors are incredible: see global.oup.com/academic/pro... or www.amazon.com/AI-Morality-...
November 24, 2024 at 9:32 PM
life / adaptation / multicellularity (from last night’s max cooper concert)
November 24, 2024 at 4:13 AM
Reposted by Saffron Huang
For my newsletter I wrote about some things I learned as a young activist that have just stayed true over the last 35 years. I’ve thought of a few more, will put those in the next one.
November 16, 2024 at 12:54 PM
Reposted by Saffron Huang
Um yes please to this starter pack go.bsky.app/2JYffeG
November 16, 2024 at 1:04 PM
going to japan… thinking about encountering japanese design in just 14 hours makes me squeal excitedly like a child
June 9, 2023 at 11:17 AM
individuals are networks of groups just as groups are networks of individuals
May 6, 2023 at 10:07 AM
boys love to jump around on the dance floor and collide into each other like bumper cars
May 1, 2023 at 11:25 PM