Lightnews — Scholar-powered news

Sara Rosenthal

@seirasto.bsky.social

1.3K followers 340 following 18 posts

NLP Research Scientist at IBM Research

Posts Replies Media Videos

Sara Rosenthal

@seirasto.bsky.social

Retrievers (Elser shown here) struggle with later turns and non-standalone questions:

January 8, 2025 at 8:10 PM

Sara Rosenthal

@seirasto.bsky.social

SOTA LLMs struggle with later turns and unanswerable questions:

January 8, 2025 at 8:09 PM

Sara Rosenthal

@seirasto.bsky.social

Sample Conversation:

January 8, 2025 at 8:09 PM

Sara Rosenthal

@seirasto.bsky.social

MTRAG is a challenging benchmark for SOTA LLMs and a great way to evaluate across multiple domains for Retrieval and Generation! MTRAG contains 110 conversations averaging 7.7 turns each across four domains for a total of 842 tasks. We also explore synthetic data and LLM-as-a-judge.

January 8, 2025 at 8:09 PM

Sara Rosenthal

@seirasto.bsky.social

Please just message me on slack

November 25, 2024 at 1:01 PM

Sara Rosenthal

@seirasto.bsky.social

Please add me. Thanks!

November 24, 2024 at 2:33 PM

Sara Rosenthal

@seirasto.bsky.social

Please add me!

November 19, 2024 at 2:44 AM

Sara Rosenthal

@seirasto.bsky.social

This is great! Please add me as well!

November 19, 2024 at 2:42 AM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news