website: hayoungjung.me
🏦 Location: Hall C
⏲️Time: 11AM-12:30PM
🔗 Paper: aclanthology.org/2025.emnlp-m...
📁 Repo: github.com/hayoungjungg...
🏦 Location: Hall C
⏲️Time: 11AM-12:30PM
🔗 Paper: aclanthology.org/2025.emnlp-m...
📁 Repo: github.com/hayoungjungg...
➡️12.7% of recs from myth videos led to more myths initially—rising to 22% at deeper levels.
⚠️ Moderation should target these rec pathways that reinforce harmful myths.
➡️12.7% of recs from myth videos led to more myths initially—rising to 22% at deeper levels.
⚠️ Moderation should target these rec pathways that reinforce harmful myths.
😬A few clicks can change your exposure to myths!
😬A few clicks can change your exposure to myths!
📊 Finding #1: Nearly 20% of YouTube search results support OUD myths, while 30% oppose.
😰Despite more opposing, myth-supporting content is widespread—and risks shaping how people understand treatment.
📊 Finding #1: Nearly 20% of YouTube search results support OUD myths, while 30% oppose.
😰Despite more opposing, myth-supporting content is widespread—and risks shaping how people understand treatment.
📊 Achieves 0.68-0.86 macro F1 and defers only 5-67% of the examples to the costly LLM.
In practice, MythTriage:
💸 Cuts financial costs by 98% vs experts and by 94% vs LLM labeling
⏱️ Cuts time costs by 96% vs experts & by 76% vs LLM labeling
📊 Achieves 0.68-0.86 macro F1 and defers only 5-67% of the examples to the costly LLM.
In practice, MythTriage:
💸 Cuts financial costs by 98% vs experts and by 94% vs LLM labeling
⏱️ Cuts time costs by 96% vs experts & by 76% vs LLM labeling
👉 Uses lightweight DeBERTa for routine cases
👉 Defers harder ones to GPT-4o (high-performing but costly)
The trick? We distilled DeBERTa on GPT-4o’s synthetic labels—achieving strong performance without massive expert-labeled data.
👉 Uses lightweight DeBERTa for routine cases
👉 Defers harder ones to GPT-4o (high-performing but costly)
The trick? We distilled DeBERTa on GPT-4o’s synthetic labels—achieving strong performance without massive expert-labeled data.
✅Validate eight pervasive myths on OUD (see examples below!)
✅Create and refine annotation guidelines
✅Build a gold-standard dataset: 310 videos labeled across 8 myths (~2.5K expert labels).
✅Validate eight pervasive myths on OUD (see examples below!)
✅Create and refine annotation guidelines
✅Build a gold-standard dataset: 310 videos labeled across 8 myths (~2.5K expert labels).
1️⃣ OUD Search Dataset: 2.9K search results
2️⃣ OUD Recs Dataset: 343K video recommendations
1️⃣ OUD Search Dataset: 2.9K search results
2️⃣ OUD Recs Dataset: 343K video recommendations
To understand the scale of such misinformation, our #EMNLP2025 paper introduces MythTriage, a scalable system to detect OUD myth🧵
To understand the scale of such misinformation, our #EMNLP2025 paper introduces MythTriage, a scalable system to detect OUD myth🧵
Since most users will likely engage with SERPs in this default settings, users in SA 🇿🇦 face a higher likelihood of misinformation exposure, raising public health risks.
Since most users will likely engage with SERPs in this default settings, users in SA 🇿🇦 face a higher likelihood of misinformation exposure, raising public health risks.
📡 5G Claims
💉 Vaccine Content Claims
💻 Bill Gates Claims
Effect sizes suggest 🇿🇦 users are at greater risk of encountering misinformative search results than 🇺🇸 users.
📡 5G Claims
💉 Vaccine Content Claims
💻 Bill Gates Claims
Effect sizes suggest 🇿🇦 users are at greater risk of encountering misinformative search results than 🇺🇸 users.
In our #icwsm '25 paper w/ @prerna6.bsky.social @tanumitra.bsky.social, we found bots in SA received significantly more misinfo in top-10 search results, which accounts for 95% of user traffic
In our #icwsm '25 paper w/ @prerna6.bsky.social @tanumitra.bsky.social, we found bots in SA received significantly more misinfo in top-10 search results, which accounts for 95% of user traffic