Check out our paper for the full methodology, human evaluation details, and comprehensive benchmarks.
What other legal NLP applications can we design using BriefMe? 🤔
Check out our paper for the full methodology, human evaluation details, and comprehensive benchmarks.
What other legal NLP applications can we design using BriefMe? 🤔
- Realistic argument completion: Llama-3.1-70B finds missing arguments only 18% of the time
- Case retrieval: Best method finds correct precedents in top-5 results just 31.4% of the time
Lots of room for improvement! 📈
- Realistic argument completion: Llama-3.1-70B finds missing arguments only 18% of the time
- Case retrieval: Best method finds correct precedents in top-5 results just 31.4% of the time
Lots of room for improvement! 📈
🤖 GPT-4o: 4.3/5 avg. LLM-as-judge rating for both arg. summ. & comp.
🤵 Lawyers: 4.0/5 (summ.) and 3.9/5 (comp.) avg. rating
LLMs excel at summarization and guided completion tasks, requiring only minor edits.
🤖 GPT-4o: 4.3/5 avg. LLM-as-judge rating for both arg. summ. & comp.
🤵 Lawyers: 4.0/5 (summ.) and 3.9/5 (comp.) avg. rating
LLMs excel at summarization and guided completion tasks, requiring only minor edits.
🧩 This realistic version is especially challenging: models must spot gaps in the ToCs with no guidance.
🧩 This realistic version is especially challenging: models must spot gaps in the ToCs with no guidance.
- Argument summarization
- Realistic/Guided Argument completion: filling in missing arguments within the Table of Contents (ToC)
- Case retrieval
Each assesses different practical aspects of legal reasoning.
- Argument summarization
- Realistic/Guided Argument completion: filling in missing arguments within the Table of Contents (ToC)
- Case retrieval
Each assesses different practical aspects of legal reasoning.
Most legal NLP work focuses on judicial opinions, but we target the attorney's perspective instead 🏛️
Most legal NLP work focuses on judicial opinions, but we target the attorney's perspective instead 🏛️
I look forward to further works on the parametric faithfulness route!
Codebase (& data): github.com/technion-cs-...
I look forward to further works on the parametric faithfulness route!
Codebase (& data): github.com/technion-cs-...