LLMs achieved higher accuracy than with the synthetic dataset—and showed no gender or political bias.
LLMs achieved higher accuracy than with the synthetic dataset—and showed no gender or political bias.
Even for the 19 accounts that were later suspended, LLMs could still infer age and gender.
We also observed gender and political biases—especially against low-activity accounts.
Even for the 19 accounts that were later suspended, LLMs could still infer age and gender.
We also observed gender and political biases—especially against low-activity accounts.
Surprisingly, in most cases, usernames outperformed full profile links.
Surprisingly, in most cases, usernames outperformed full profile links.
We show that web-browsing GPT and LLaMA models can infer social media user demographics with reasonable accuracy—using only usernames.
This opens new possibilities for social media research in the post-API era but raises important privacy concerns.
We show that web-browsing GPT and LLaMA models can infer social media user demographics with reasonable accuracy—using only usernames.
This opens new possibilities for social media research in the post-API era but raises important privacy concerns.