Dialz: A Python Toolkit for Steering Vectors
ArXiv: arxiv.org/abs/2505.06262
Docs: cardiffnlp.github.io/dialz/
Repo: github.com/cardiffnlp/d...
A Python package to help you create, apply and visualise steering vectors for anything you want - from sycophancy to bias.
Dialz: A Python Toolkit for Steering Vectors
ArXiv: arxiv.org/abs/2505.06262
Docs: cardiffnlp.github.io/dialz/
Repo: github.com/cardiffnlp/d...
A Python package to help you create, apply and visualise steering vectors for anything you want - from sycophancy to bias.
Dialz: A Python Toolkit for Steering Vectors
ArXiv: arxiv.org/abs/2505.06262
Docs: cardiffnlp.github.io/dialz/
Repo: github.com/cardiffnlp/d...
A Python package to help you create, apply and visualise steering vectors for anything you want - from sycophancy to bias.
📅Deadline: 6 June. Priority will be given to those who completed the EoI, but we have a few additional places available!
Check out the list of amazing speakers and find out more on our website www.cardiffnlpworkshop.org
Shifting Perspectives: Steering Vector Ensembles for Robust Bias Mitigation in LLMs
ArXiv: arxiv.org/abs/2503.05371
GitHub: github.com/groovychoons...
Extremely Unofficial Blog Post: zarasiddique.com/blog/shiftin...
Shifting Perspectives: Steering Vector Ensembles for Robust Bias Mitigation in LLMs
ArXiv: arxiv.org/abs/2503.05371
GitHub: github.com/groovychoons...
Extremely Unofficial Blog Post: zarasiddique.com/blog/shiftin...
If you can’t make it, please share with others who may be interested!
📍 Cardiff (Wales, UK)
✨Free registration and some accommodation options!✨
ℹ️ For more information: www.cardiffnlpworkshop.org
📝 Join us by completing the expression of interest form: forms.gle/rY1YCDgcjFDt...
If you can’t make it, please share with others who may be interested!
🔗 www.404media.co/openai-furio...
🔗 www.404media.co/openai-furio...
It's a weekend of watching for me 🍿
It's a weekend of watching for me 🍿
Done with @zarasiddique.bsky.social , @hsuvas.bsky.social and @antypasd.bsky.social
Done with @zarasiddique.bsky.social , @hsuvas.bsky.social and @antypasd.bsky.social
Will follow up with favourite papers in blog post form soon!
Will follow up with favourite papers in blog post form soon!