We’re serious! Economic coordination happens via emails. How do humans fare against AIs in getting things done with words?
We see a genre co-emerging with LLMs: communication games, where communication is crucial and not just “cheap talk” like Mafia or Diplomacy.
We’re serious! Economic coordination happens via emails. How do humans fare against AIs in getting things done with words?
We see a genre co-emerging with LLMs: communication games, where communication is crucial and not just “cheap talk” like Mafia or Diplomacy.
Big thanks to @chenhaotan.bsky.social for advice on the project, as well as helpful feedback from the wonderful members of the @chicagohai.bsky.social lab! Check out our code at github.com/ChicagoHAI/l....
DM me for any questions!
Big thanks to @chenhaotan.bsky.social for advice on the project, as well as helpful feedback from the wonderful members of the @chicagohai.bsky.social lab! Check out our code at github.com/ChicagoHAI/l....
DM me for any questions!
So strangely, changing the prompt can change how a model represents race. Thus, in some cases, the model’s representation may be sensitive to spurious prompt features, which poses a challenge to the generalizability of debiasing methods. Future work on debiasing should take this into account.
So strangely, changing the prompt can change how a model represents race. Thus, in some cases, the model’s representation may be sensitive to spurious prompt features, which poses a challenge to the generalizability of debiasing methods. Future work on debiasing should take this into account.
We found the race subspace generalizes cross-family (from admissions to hiring) and, to a lesser extent, cross-explicitness (from implicit race via name to explicit race), but it fails to generalize cross-prompt (from one prompt template to another).
We found the race subspace generalizes cross-family (from admissions to hiring) and, to a lesser extent, cross-explicitness (from implicit race via name to explicit race), but it fails to generalize cross-prompt (from one prompt template to another).