catherine 🌀
banner
catherinebrewer.bsky.social
catherine 🌀
@catherinebrewer.bsky.social
ai governance @openphil, unsupervised learner
i now have an ideas/questions/hare-brained schemes slack channel — if we've spoken, feel free to DM if you'd like to be added :)
January 11, 2025 at 11:44 AM
one of my most strongly held and least consequential beliefs is that knightian uncertainty just isn't a real thing
December 17, 2024 at 8:50 AM
Reposted by catherine 🌀
If a model card reports the results of dangerous capabilities evals and doesn't specify that they were conducted with safeguards removed, you should assume that safeguards were in place for testing.

Short 🧵 👇
December 4, 2024 at 10:28 AM
so who is making huelnog for this festive season ?
November 29, 2024 at 1:19 PM
i think RAND's report on securing ai model weights is great, i also think that if you read the whole thing you should get a little sticker or a pin badge or something
November 28, 2024 at 2:25 PM
i want to like bluesky but it doesn't quite have that disgusting eating pennies/cutting your tongue/looking too long at the sun feel that the other place has
November 20, 2024 at 2:33 PM