Our results show that current unlearning methods for AI safety only obfuscate dangerous knowledge, just like standard safety training.
Here's what we found👇
If you can’t catch me during the week, stop by our poster on the weekend or join the presentation!
If you can’t catch me during the week, stop by our poster on the weekend or join the presentation!
Join my oral presentation on Saturday at 4:30 pm to learn more.
Join my oral presentation on Saturday at 4:30 pm to learn more.
Our results show that current unlearning methods for AI safety only obfuscate dangerous knowledge, just like standard safety training.
Here's what we found👇
Our results show that current unlearning methods for AI safety only obfuscate dangerous knowledge, just like standard safety training.
Here's what we found👇