Natural Language Processing | Artificial Intelligence | Computational Linguistics | Human-centric NLP
🔎 263 languages, 10 similarity measures, 3 NLP tasks
👥 @verenablaschke.bsky.social Masha Fedzechkina @maartjeterhoeve.bsky.social
🔗 arxiv.org/abs/2501.14491
📁 Findings – long
🔎 263 languages, 10 similarity measures, 3 NLP tasks
👥 @verenablaschke.bsky.social Masha Fedzechkina @maartjeterhoeve.bsky.social
🔗 arxiv.org/abs/2501.14491
📁 Findings – long
🔎Analyzing how human-like LLMs are when taking reading, history, and economics tests
👥 @saeub.bsky.social , Diego Frassinelli, @barbaraplank.bsky.social
🔗 arxiv.org/abs/2506.09796
📁BEA workshop - Long
🔎Analyzing how human-like LLMs are when taking reading, history, and economics tests
👥 @saeub.bsky.social , Diego Frassinelli, @barbaraplank.bsky.social
🔗 arxiv.org/abs/2506.09796
📁BEA workshop - Long
🔎 We release a novel German anamnesis question-response dataset with human-simulated and LLM-augmented responses.
👥 @JHofenbitzer et al.
🔗 github.com/Jhofenbitzer...
📁SRW - Long
🔎 We release a novel German anamnesis question-response dataset with human-simulated and LLM-augmented responses.
👥 @JHofenbitzer et al.
🔗 github.com/Jhofenbitzer...
📁SRW - Long
🔎Do LLMs encode and generalize discourse knowledge across languages?
👥 @florian-eichin.com @janetlauyeung.bsky.social @mhedderich.bsky.social @barbaraplank.bsky.social
🔗 arxiv.org/abs/2503.10515
📁Main - Long
🔎Do LLMs encode and generalize discourse knowledge across languages?
👥 @florian-eichin.com @janetlauyeung.bsky.social @mhedderich.bsky.social @barbaraplank.bsky.social
🔗 arxiv.org/abs/2503.10515
📁Main - Long
🔎We present a large-scale study of whether LLM judgments can be reliably used as proxies for human judgments
👥Anna Bavaresco et al.
🔗 arxiv.org/abs/2406.18403
📁Main - Short
🔎We present a large-scale study of whether LLM judgments can be reliably used as proxies for human judgments
👥Anna Bavaresco et al.
🔗 arxiv.org/abs/2406.18403
📁Main - Short
👥 @mhedderich.bsky.social Anyi Wang @raoyuan.bsky.social @florian-eichin.com Jonas Fischer @barbaraplank.bsky.social
🔗 arxiv.org/abs/2504.158...
📁Main - Long
👥 @mhedderich.bsky.social Anyi Wang @raoyuan.bsky.social @florian-eichin.com Jonas Fischer @barbaraplank.bsky.social
🔗 arxiv.org/abs/2504.158...
📁Main - Long
👥 @beiduo.bsky.social Siyao Peng @annakorhonen.bsky.social @barbaraplank.bsky.social
🔗 arxiv.org/abs/2412.13942
📁ACL25 Findings-Long
👥 @beiduo.bsky.social Siyao Peng @annakorhonen.bsky.social @barbaraplank.bsky.social
🔗 arxiv.org/abs/2412.13942
📁ACL25 Findings-Long
🔎We study the relationship between circuits for highly compositional and functionally related tasks
👥@pmondorf.bsky.social Sondre Wold @barbaraplank.bsky.social
🔗 arxiv.org/abs/2410.01434
📁Main-Long
🔎We study the relationship between circuits for highly compositional and functionally related tasks
👥@pmondorf.bsky.social Sondre Wold @barbaraplank.bsky.social
🔗 arxiv.org/abs/2410.01434
📁Main-Long
🔎We review existing datasets for evaluating LLMs’ pragmatic capabilities, outlining key challenges and promising future directions
🔗 arxiv.org/abs/2502.12378
📁Main - Long
🔎We review existing datasets for evaluating LLMs’ pragmatic capabilities, outlining key challenges and promising future directions
🔗 arxiv.org/abs/2502.12378
📁Main - Long
🔎This study evaluates LLMs in generating German public opinions using open-ended survey data
🔗 arxiv.org/abs/2412.13169
📁Main - Long
🔎This study evaluates LLMs in generating German public opinions using open-ended survey data
🔗 arxiv.org/abs/2412.13169
📁Main - Long