https://www.madelonhulsebos.com
Really enjoyed this reflection exercise 🙂
Really enjoyed this reflection exercise 🙂
With talks by Marine Le Morvan (Inria), Floris Geerts (University of Antwerp) and @akhtarmubashara.bsky.social (ETH Zurich), and more TBA.
We hope to meet you there!!
With talks by Marine Le Morvan (Inria), Floris Geerts (University of Antwerp) and @akhtarmubashara.bsky.social (ETH Zurich), and more TBA.
We hope to meet you there!!
Bottomline: LLMs aren't robust for real-world multi-table QA
Bottomline: LLMs aren't robust for real-world multi-table QA
"How well do LLMs reason over tabular data, really?" 📊
We dig into two important questions:
1️⃣ Are general-purpose LLMs robust with real-world tables?
2️⃣ How should we actually evaluate them? (2/4)
"How well do LLMs reason over tabular data, really?" 📊
We dig into two important questions:
1️⃣ Are general-purpose LLMs robust with real-world tables?
2️⃣ How should we actually evaluate them? (2/4)