Lightnews — Scholar-powered news

Jakub Łucki

@jakublucki.bsky.social

24 followers 38 following 11 posts

Visiting Researcher at NASA JPL | Data Science MSc at ETH Zurich

Posts Replies Media Videos

Pinned

Jakub Łucki @jakublucki.bsky.social · Dec 6

🚨Unlearned hazardous knowledge can be retrieved from LLMs 🚨

Our results show that current unlearning methods for AI safety only obfuscate dangerous knowledge, just like standard safety training.

Here's what we found👇

Jakub Łucki

@jakublucki.bsky.social

Just arrived in Vancouver for #NeurIPS! If you’d like to chat about cutting-edge research, let me know! I’ve always been curious about far too many things (for my own good), so all topics are welcome.

If you can’t catch me during the week, stop by our poster on the weekend or join the presentation!

December 10, 2024 at 1:58 AM

Jakub Łucki

@jakublucki.bsky.social

Our paper on how unlearning fails to remove hazardous knowledge from LLM weights received 🏆 Best Paper 🏆 award at SoLaR @ NeurIPS!

Join my oral presentation on Saturday at 4:30 pm to learn more.

December 6, 2024 at 5:58 PM

Jakub Łucki

@jakublucki.bsky.social

December 6, 2024 at 5:47 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news