"Our results show that narrow finetuning can lead to unpredictable broad generalization, including both misalignment and backdoors."
"Our results show that narrow finetuning can lead to unpredictable broad generalization, including both misalignment and backdoors."
Covers concepts, finetuning, evaluations.
#ArtificialIntelligence #MachineLearning #DeepLearning #DataScience #Analytics
Covers concepts, finetuning, evaluations.
#ArtificialIntelligence #MachineLearning #DeepLearning #DataScience #Analytics
#AgenticRAG #Chroma #Finetuning
doyouknow.kr/622/rag-adva...
#AgenticRAG #Chroma #Finetuning
doyouknow.kr/622/rag-adva...
recently people were jimmying claude to get training data back out and they got a "soul doc"
recently people were jimmying claude to get training data back out and they got a "soul doc"
but you actually just fabricate what's unlocked so you're feeding them a false narrative
Currently, it is way to easy to just let the mech blow itself up using its own missiles. But it is on theme for a ramshackle goblin mech 🤔
#gamedev #indiedev #gamemaker
Currently, it is way to easy to just let the mech blow itself up using its own missiles. But it is on theme for a ramshackle goblin mech 🤔
#gamedev #indiedev #gamemaker
- complex sampling techniques – e.g., beam-searching for diverse strings in token sequence space dl.acm.org/doi/full/10....
- finetuning – e.g., "forcing diffuse distributions" arxiv.org/abs/2404.10859; our recent COLM work arxiv.org/abs/2503.17126
- complex sampling techniques – e.g., beam-searching for diverse strings in token sequence space dl.acm.org/doi/full/10....
- finetuning – e.g., "forcing diffuse distributions" arxiv.org/abs/2404.10859; our recent COLM work arxiv.org/abs/2503.17126
when will it dawn that LLMs are fundamentally vulnerable to endless new attacks like this, and therefore fundamentally unsafe?
https://arxiv.org/html/2511.15304v1
when will it dawn that LLMs are fundamentally vulnerable to endless new attacks like this, and therefore fundamentally unsafe?
1. Another challenge is lack of serious engagement from AI researchers with humanities scholars.
2. This new work makes it sound like it just takes some finetuning to get big improvements.
bsky.app/profile/tuhi...
Authors have sued LLM companies for using books w/o permission for model training.
Courts however need empirical evidence of market harm. Our preregistered study exactly addresses this gap.
Joint work w Jane Ginsburg from Columbia Law and @dhillonp.bsky.social 1/n🧵
1. Another challenge is lack of serious engagement from AI researchers with humanities scholars.
2. This new work makes it sound like it just takes some finetuning to get big improvements.
bsky.app/profile/tuhi...
…☕️📱🐅 & Alltag hier …
…& die letzten 2 Wochen Wohnzimmer streichen & Schränke ausräumen, umstellen, einräumen & nun dann hier alles, auch Space Homeoffice / Esstisch, aufgeräumt ✅
…Kleinigkeiten Haushalt ✅
…& noch Finetuning Haus IT 🛜✅
…☕️📱🐅 & Alltag hier …
…& die letzten 2 Wochen Wohnzimmer streichen & Schränke ausräumen, umstellen, einräumen & nun dann hier alles, auch Space Homeoffice / Esstisch, aufgeräumt ✅
…Kleinigkeiten Haushalt ✅
…& noch Finetuning Haus IT 🛜✅