Lightnews — Scholar-powered news

Fateme Hashemi Chaleshtori

@fatemehc.bsky.social

340 followers 230 following 10 posts

PhD student at Utah NLP, Mechanistic Interpretability, Trustworthy AI, Human-centered AI

Posts Replies Media Videos

Fateme Hashemi Chaleshtori

@fatemehc.bsky.social

9/ We hope BriefMe encourages more Legal NLP development that directly aids legal professionals!
Check out our paper for the full methodology, human evaluation details, and comprehensive benchmarks.

What other legal NLP applications can we design using BriefMe? 🤔

June 20, 2025 at 10:07 PM

Fateme Hashemi Chaleshtori

@fatemehc.bsky.social

8/ ⚖️ BriefMe extends Legal NLP by introducing a dataset of legal briefs, a type of legal document that hasn't been overlooked before. We've designed tasks that attorneys actually need in their daily work, opening up new research directions to be explored to assist professionals.

June 20, 2025 at 10:07 PM

Fateme Hashemi Chaleshtori

@fatemehc.bsky.social

7/ However, LLMs struggle with these complex tasks:
- Realistic argument completion: Llama-3.1-70B finds missing arguments only 18% of the time
- Case retrieval: Best method finds correct precedents in top-5 results just 31.4% of the time

Lots of room for improvement! 📈

June 20, 2025 at 10:07 PM

Fateme Hashemi Chaleshtori

@fatemehc.bsky.social

6/ Surprising finding: GPT-4o outperforms human-written headings!
🤖 GPT-4o: 4.3/5 avg. LLM-as-judge rating for both arg. summ. & comp.
🤵 Lawyers: 4.0/5 (summ.) and 3.9/5 (comp.) avg. rating
LLMs excel at summarization and guided completion tasks, requiring only minor edits.

June 20, 2025 at 10:07 PM

Fateme Hashemi Chaleshtori

@fatemehc.bsky.social

5/ Evaluating generated text is challenging: traditional metrics (BLEU/ROUGE/...) are not aligned with human preferences. Instead, we built an LLM-as-judge using o3-mini, instructed with expert-written guidelines for brief headings, proving more reliable than human raters!

June 20, 2025 at 10:07 PM

Fateme Hashemi Chaleshtori

@fatemehc.bsky.social

4/ Our novel argument completion task tests if LLMs can identify WHERE exactly a missing argument should go in a brief's logical flow and WHAT that argument should be.
🧩 This realistic version is especially challenging: models must spot gaps in the ToCs with no guidance.

June 20, 2025 at 10:07 PM

Fateme Hashemi Chaleshtori

@fatemehc.bsky.social

3/ We built BriefMe from Supreme Court briefs with 3 key tasks:
- Argument summarization
- Realistic/Guided Argument completion: filling in missing arguments within the Table of Contents (ToC)
- Case retrieval
Each assesses different practical aspects of legal reasoning.

June 20, 2025 at 10:07 PM

Fateme Hashemi Chaleshtori

@fatemehc.bsky.social

2/ Legal briefs are documents where attorneys present their arguments to judges, making the case for their client's position by interpreting the law and citing relevant precedents.
Most legal NLP work focuses on judicial opinions, but we target the attorney's perspective instead 🏛️

June 20, 2025 at 10:07 PM

Reposted by Fateme Hashemi Chaleshtori

Martin Tutek

@mtutek.bsky.social

It has been amazing to work with @fatemehc.bsky.social, @anamarasovic.bsky.social and Yonatan Belinkov on this incredibly important topic.

I look forward to further works on the parametric faithfulness route!

Codebase (& data): github.com/technion-cs-...

GitHub - technion-cs-nlp/parametric-faithfulness

Contribute to technion-cs-nlp/parametric-faithfulness development by creating an account on GitHub.

github.com

February 21, 2025 at 12:43 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news