Lightnews — Scholar-powered news

Gabriel Chua

@gabrielchua.bsky.social

13 followers 19 following 8 posts

Machine Learning at GovTech

gabrielchua.me

Posts Replies Media Videos

Gabriel Chua

@gabrielchua.bsky.social

We’ve open-sourced our 2 classifiers & the dataset (almost 50M tokens)

These classifier are:
- fast ⚡
- accurate & give well-calibrated probabilities ⚖️ (so that we can have differentiated responses)
- zero-shot 🔎 (i.e., teams can use this out of the box)

huggingface.co/collections/...

November 27, 2024 at 12:57 AM

Gabriel Chua

@gabrielchua.bsky.social

This approach works surprisingly well, and we apply it to the "off-topic" prompt detection.

The goal is to classify whether a user-prompt is irrelevant with respect to the system prompt. 🎯

November 27, 2024 at 12:57 AM

Gabriel Chua

@gabrielchua.bsky.social

Here, we explore a data-free guardrail development methodology leveraging LLMs to guard LLMs.

November 27, 2024 at 12:57 AM

Gabriel Chua

@gabrielchua.bsky.social

🚨 new applied ai paper from govtech

LLMs are powerful, but they're prone to off-topic misuse, where users push them beyond their intended scope. Think harmful prompts, jailbreaks, and misuse. So how do we build better guardrails?

arxiv.org/abs/2411.12946

November 27, 2024 at 12:57 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news