Lightnews — Scholar-powered news

Reposted by Aryo Pradipta Gema

Emile van Krieken

@emilevankrieken.com

We propose Neurosymbolic Diffusion Models! We find diffusion is especially compelling for neurosymbolic approaches, combining powerful multimodal understanding with symbolic reasoning 🚀

Read more 👇

May 21, 2025 at 10:57 AM

Aryo Pradipta Gema

@aryopg.bsky.social

MMLU-Redux just touched down at #NAACL2025! 🎉
Wish I could be there for our "Are We Done with MMLU?" poster today (9:00-10:30am in Hall 3, Poster Session 7), but visa drama said nope 😅
If anyone's swinging by, give our research some love! Hit me up if you check it out! 👋

May 2, 2025 at 1:00 PM

Reposted by Aryo Pradipta Gema

Benjamin Minixhofer

@bminixhofer.bsky.social

We created Approximate Likelihood Matching, a principled (and very effective) method for *cross-tokenizer distillation*!

With ALM, you can create ensembles of models from different families, convert existing subword-level models to byte-level and a bunch more🧵

Image illustrating that ALM can enable Ensembling, Transfer to Bytes, and general Cross-Tokenizer Distillation.

April 2, 2025 at 6:36 AM

Reposted by Aryo Pradipta Gema

Alex Murphy

@alexandermurphy.bsky.social

🚀 Thrilled to share our new preprint, "An Analysis of Decoding Methods for LLM-based Agents for Faithful Multi-Hop Question Answering"! 📄

Dive into the paper: arxiv.org/abs/2503.23415

#AI #MachineLearning #LLM #NLP #Research #QuestionAnswering #Retrieval

April 1, 2025 at 1:56 PM

Aryo Pradipta Gema

@aryopg.bsky.social

Today, I'm starting as an AI Safety Fellow @anthropic.com ! 🚀
Super excited to collaborate and learn from some of the brightest minds in AI! 🌟

March 24, 2025 at 8:51 AM

Reposted by Aryo Pradipta Gema

Pasquale Minervini

@neuralnoise.com

madly in love with this article from @thesun.co.uk covering a paper from @rohit-saxena.bsky.social and @aryopg.bsky.social

Paper: arxiv.org/abs/2502.05092
The Sun: www.thesun.co.uk/tech/3384555...

March 17, 2025 at 12:41 PM

Reposted by Aryo Pradipta Gema

Rohit Saxena

@rohit-saxena.bsky.social

Can multimodal LLMs truly understand research poster images?📊

🚀 We introduce PosterSum—a new multimodal benchmark for scientific poster summarization!

📂 Dataset: huggingface.co/datasets/rohitsaxena/PosterSum
📜 Paper: arxiv.org/abs/2502.17540

March 10, 2025 at 2:19 PM

Reposted by Aryo Pradipta Gema

Pasquale Minervini

@neuralnoise.com

Garbage in, garbage out -- nice gem for the Italian-speaking folks on this platform 😅 TLDR, in arxiv.org/abs/2406.04127 we found that MMLU contains TONS of errors, and looks like all these seamlessly propagated to this new "Global MMLU" dataset

December 6, 2024 at 1:17 PM

Aryo Pradipta Gema

@aryopg.bsky.social

Super Cool work from Cohere for AI! 🎉 However, this highlights a concern raised by our MMLU-Redux team (arxiv.org/abs/2406.04127): **error propagation to many languages**. Issues in MMLU (e.g., "rapid intervention to solve ebola") seem to persist in many languages. Let's solve the root cause first?

Sara Hooker @sarahooker.bsky.social · Dec 5

Is MMLU Western-centric? 🤔

As part of a massive cross-institutional collaboration:
🗽Find MMLU is heavily overfit to western culture
🔍 Professional annotation of cultural sensitivity data
🌍 Release improved Global-MMLU 42 languages

📜 Paper: arxiv.org/pdf/2412.03304
📂 Data: hf.co/datasets/Coh...

December 6, 2024 at 9:38 AM

Reposted by Aryo Pradipta Gema

Pasquale Minervini

@neuralnoise.com

For clarity -- great project, but most of the MMLU errors we found (and fixed) in our MMLU Redux paper (arxiv.org/abs/2406.04127) are also present in this dataset. We also provide a curated version of MMLU, so it's easy to fix 😊

Daniel Vila @dvilasuero.hf.co · Dec 6

Announcing Global-MMLU - an improved MMLU Open dataset with evaluation coverage across 42 languages.

The result of months of work with the goal of advancing Multilingual LLM evaluation.

Built together with the community and amazing collaborators at Cohere4AI, MILA, MIT, and many more.

December 6, 2024 at 9:26 AM

Reposted by Aryo Pradipta Gema

Cyril Zakka, MD

@cyrilzakka.bsky.social

Super excited to introduce Halo: A beginner's guide to DIY health tracking with wearables! 🤗✨
Using an $11 smart ring, I'll show you how to build your own private health monitoring app. From basic metrics to advanced features like:
- Activity tracking
- HR monitoring
- Sleep analysis
and more!

A picture showing Halo's features which include heart rate, sleep cycle and SPO2 monitoring, using on-device ML.

November 19, 2024 at 6:59 PM

Reposted by Aryo Pradipta Gema

Pasquale Minervini

@neuralnoise.com

Starter pack for University of Edinburgh researchers done by the amazing ramandutt4.bsky.social - go.bsky.app/KRNDkN7

University of Edinburgh Starter Pack

Join the conversation

go.bsky.app

November 20, 2024 at 4:39 PM

Reposted by Aryo Pradipta Gema

Sweta Karlekar

@swetakar.bsky.social

If you’re interested in mechanistic interpretability, I just found this starter pack and wanted to boost it (thanks for creating it @butanium.bsky.social !). Excited to have a mech interp community on bluesky 🎉

go.bsky.app/LisK3CP

November 19, 2024 at 12:28 AM

Reposted by Aryo Pradipta Gema

Pasquale Minervini

@neuralnoise.com

Joining the Generative AI Lab (GAIL, gail.ed.ac.uk) at the University of Edinburgh as a GAIL Fellow! Excited for what's ahead 🤗

Generative AI Laboratory

gail.ed.ac.uk

November 19, 2024 at 11:17 PM

Reposted by Aryo Pradipta Gema

Giwon Hong

@giwonhong.bsky.social

🤔How to achieve efficient ICL without storing a huge dataset in one prompt?
💡Mixtures of In-Context Learners (𝗠𝗼𝗜𝗖𝗟): we treat LLMs prompted with subsets of demonstrations as experts and learn a weighting function to optimise the distribution over the continuation (🧵1/n)

November 18, 2024 at 6:36 PM

Reposted by Aryo Pradipta Gema

Maria Antoniak

@mariaa.bsky.social

Started making a list of researchers working at the intersection of healthcare, language, and computation. Please help me add more people!

November 18, 2024 at 11:09 AM

Aryo Pradipta Gema

@aryopg.bsky.social

I’ll be travelling to London from Wednesday to Friday for an upcoming event and would be very happy to meet up! 🚀
I'd love to chat about my recent works (DeCoRe, MMLU-Redux, etc.). DM me if you’re around! 👋

DeCoRe: arxiv.org/abs/2410.18860
MMLU-Redux: arxiv.org/abs/2406.04127

November 18, 2024 at 1:48 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news