Lightnews — Scholar-powered news

Integrity Institute

@integrity-inst.bsky.social

How do we build safer LLMs from the start - not just after training?

II member #AdityaJain shows why safety efforts must start in pre-training. If large language models (LLMs) learn harmful behaviors early, post-training fixes like SFT and RLHF can be insufficient.

Read more here 🔗 bit.ly/4sUtPn8

Emerging Ideas in Responsible AI - Safety Pretraining: Toward the Next Generation of Safe AI,

Making LLMs natively safer

adityajain93.substack.com

January 22, 2026 at 4:16 PM

Integrity Institute

@integrity-inst.bsky.social

▶️ Check out the 2nd edition of II members Maggie Engler & @numa.bsky.social 's book "Introduction to GenAI: Reliable, Responsible & Real-world Applications"

It features chapters on prompt engineering & AI agents, plus a practical guide to how GenAI works and how to manage its risks. bit.ly/4qtICnf

Introduction to Generative AI, Second Edition - Numa Dhamani and Maggie Engler

Get up to speed quickly with generative AI! AI tools like ChatGPT and Gemini, automated coding tools like Cursor and Copilot, and countless LLM-powered agents have become a part of daily life. They’v...

bit.ly

January 21, 2026 at 4:40 PM

Integrity Institute

@integrity-inst.bsky.social

Congrats to II member @mattmotyl.bsky.social on launching Show Me the Data! 🎉

'Show Me the Data' is a must-read resource for data transparency and platform accountability, especially as new requirements like DSA Section 40 take effect.

bit.ly/45dv0El

January 15, 2026 at 7:48 PM

Integrity Institute

@integrity-inst.bsky.social

Missed our event on Navigating Youth Safety in AI Applications? You can now watch the full convo on YouTube!

Alison Lee, @vaishnavi.bsky.social, & Ariel C. discussed how tech companies can better protect young users - covering red-teaming, AI chat risks, & safety by design.

🔗 lnkd.in/ehF3ZsFg

January 14, 2026 at 5:30 PM

Integrity Institute

@integrity-inst.bsky.social

We’re excited for the release of II member @lucab.phd ’s upcoming book, "Hidden Influences: How Algorithmic Recommenders Shape Our Lives."

It's a must-read for anyone interested in understanding how recommender systems work and how they can be improved.

bit.ly/3NgQZnG

Hidden Influences - Luca Belli

You don’t choose what you see on the internet. Algorithms choose for you. We spend 12 billion hours a day browsing the internet. And in the background, sophisticated recommender systems silently deci...

bit.ly

January 13, 2026 at 9:00 PM

Integrity Institute

@integrity-inst.bsky.social

We’re thrilled to have contributed to Lena Slachmuijlder's report on Pro-Social Tech Design Regulation.

This guide is a critical resource that offers practical steps for reducing systemic online harms and building safer, more responsible digital spaces.

🔗 Read the full report here bit.ly/3N9IqLr

January 12, 2026 at 6:32 PM

Reposted by Integrity Institute

Juliet Shen

@julietshen.bsky.social

One thing that stuck out to me was how often I heard "It's so nice to know I'm not alone in this."

Working in online safety can feel isolating, and there aren't enough spaces where builders can come together and reach for the limits of what's possible, together. This is what OSS is all about!

ROOST @roost.tools · Dec 10

On Monday, @hf.co , OpenAI, and ROOST hosted the first Open Safeguard Hackathon. The room was filled with people who care deeply about safety, working together to explore and test gpt-oss-safeguard. Check out the ROOST Model Community for some of the projects github.com/roostorg/mod...

A large group of people pose for a photo on a stage in front of a screen that says "open safeguard hackathon featuring openai, roost, and hugging face". The people are wearing blue lanyards and wave their hands enthusiastically

December 10, 2025 at 6:57 PM

Integrity Institute

@integrity-inst.bsky.social

Honored @ofcom.bsky.social cited our submission w/ @socialcohesiontech.bsky.social & @marshall.usc.edu in “Guidance on a Safer Life Online for Women and Girls.”

Proud to help highlight online risks and support initiatives that make the internet safer for women, girls, and all users.

bit.ly/4a9sJxc

December 10, 2025 at 4:15 PM

Integrity Institute

@integrity-inst.bsky.social

Our co-founder & CRO @jeffallen.bsky.social joined @techpolicypress.bsky.social to break down Trust & Safety’s past, present, and future.

The conversation explores how the field has evolved and what the future will require from platforms.

Listen here: bit.ly/48ycWXq

Considering Trust and Safety's Past, Present, and Future | TechPolicy.Press

Dean Jackson discusses the field's future with law professors Danielle Keats Citron and Ari Ezra Waldman and Jeff Allen from the Integrity Institute.

www.techpolicy.press

December 10, 2025 at 2:46 PM

Integrity Institute

@integrity-inst.bsky.social

Our research team - @jeffallen.bsky.social and Spencer Gurley - attended this thoughtful multistakeholder exchange on one of the most important regulations of our time.

We're grateful to @cdteu.org and all the participants for fostering such a productive conversation. 🤝🌐

CDT Europe @cdteu.org · Nov 28

Last Friday we hosted our annual roundtable with the #DSA CSO Coordination Group in Brussels 🎉

Civil society organisations, academics, policymakers and regulators joined us for a full day focused on Year Two of the Digital Services Act.

It was an energising and thoughtful exchange 👇

Room view of the participants and speakers in the high-level panel at the DSA CSOs Roundtable Event

December 5, 2025 at 2:44 PM

Integrity Institute

@integrity-inst.bsky.social

🎊 Our new Partnerships Brochure is live! It outlines ways we can work together to strengthen online integrity, advance research, and support ethical tech leadership.

If this mission resonates, reach out — and share with others building a healthier digital world.

👉 Link here bit.ly/47PPEvV

November 19, 2025 at 4:48 PM

Integrity Institute

@integrity-inst.bsky.social

‼️ Tune in Tomorrow!

Integrity Institute @integrity-inst.bsky.social · Nov 10

📣 Event Happening This Friday!

Don't miss an opportunity to hear from Trust & Safety expert Nicholas Shen, currently leading T&S for Tools for Humanity, about the impact of technology on American democracy.

📅 Friday, 11/14 at 12:00 PM Eastern Time

▶️ RSVP here: www.linkedin.com/feed/update/...

November 13, 2025 at 6:14 PM

Reposted by Integrity Institute

Juliet Shen

@julietshen.bsky.social

come listen to my super smart brother talk about trust & safety and elections!

Integrity Institute @integrity-inst.bsky.social · Nov 10

📣 Event Happening This Friday!

Don't miss an opportunity to hear from Trust & Safety expert Nicholas Shen, currently leading T&S for Tools for Humanity, about the impact of technology on American democracy.

📅 Friday, 11/14 at 12:00 PM Eastern Time

▶️ RSVP here: www.linkedin.com/feed/update/...

November 11, 2025 at 2:38 AM

Integrity Institute

@integrity-inst.bsky.social

📣 Event Happening This Friday!

Don't miss an opportunity to hear from Trust & Safety expert Nicholas Shen, currently leading T&S for Tools for Humanity, about the impact of technology on American democracy.

📅 Friday, 11/14 at 12:00 PM Eastern Time

▶️ RSVP here: www.linkedin.com/feed/update/...

November 10, 2025 at 9:05 PM

Integrity Institute

@integrity-inst.bsky.social

💡 Member Spotlight: Ilamosi Ekenimoh

Ilamosi is a regulatory & tech policy expert with a decade of experience shaping digital governance, AI policy, & regulatory strategy in Africa & globally. At Flutterwave she manages global public policy & ESG framework development.

#TechPolicy #AI #ESG #Meta

November 5, 2025 at 5:36 PM

Integrity Institute

@integrity-inst.bsky.social

📣 Happening TOMORROW!

Recent AI chatbot incidents show youth are at risk online. Join experts Vaishnavi J, Alison Lee, and Ariel Colon as they share practical strategies for safer AI, including red-teaming, long-chat risks, and safety-by-design.

Register here: bit.ly/47mmZ1f

November 3, 2025 at 6:51 PM

Reposted by Integrity Institute

Electronic Frontier Foundation

@eff.org

Happy Amazon Prime Day! Amazon collects mountains of data about how you use the service, but there is a setting you can change to make it harder for the company to use that data to sell you more things. #OptOutOctober www.eff.org/deeplinks/2...

October 7, 2025 at 4:39 PM

Integrity Institute

@integrity-inst.bsky.social

Missed our event on Article 40: Researcher Access under the Digital Services Act in the EU?

Check out the recording now on our YouTube channel!

🎦 www.youtube.com/watch?v=7gOK...

Using Platform Datasets in Public Interest Tech Research

YouTube video by Integrity Institute

www.youtube.com

October 29, 2025 at 1:19 PM

Integrity Institute

@integrity-inst.bsky.social

📲 New Event: Navigating Youth Safety in AI Applications

Join to @vaishnavi.bsky.social, Alison Lee of the Rithm Project, and Ariel Colon of apgard ai to discuss the practical approaches the companies can take to address risks to young people!

Nov. 4 at 3pm ET
Register here: bit.ly/47sApaz

October 28, 2025 at 4:04 PM

Integrity Institute

@integrity-inst.bsky.social

💬 Can AI break the engagement trap?

In August, our co-founder @jeffallen.bsky.social reflected on the evolution of Trust & Safety, and why regulatory pressure—and transparency—still matter most.

▶️ Read the full conversation on the @socialcohesiontech.bsky.social Substack: bit.ly/4neJfi5

October 24, 2025 at 2:13 PM

Integrity Institute

@integrity-inst.bsky.social

🌟 Announcing Our 2024 Annual Report! 🌟

The Integrity Institute is committed to tackling the negative impact of the social internet. Discover our achievement and key influences over the past year + our community of people dedicated to making the internet a safer place!

Read here: bit.ly/42T0yOF

October 22, 2025 at 2:06 PM

Integrity Institute

@integrity-inst.bsky.social

💡 Member Spotlight: Aditya Gautam!

Aditya works at the intersection of AI safety and recommender systems, building trustworthy LLMs and agents. He was previously a ML lead at Meta and founding engineer at Google Area 120.

Aditya is also a featured conference speaker + reviewer (NeurIPS, AAAI). 👏

October 9, 2025 at 4:36 PM

Integrity Institute

@integrity-inst.bsky.social

📣 Tune in TODAY at 1pm ET!

Don't miss this insightful conversation about the Trump Administration's New AI Action Plan with policy experts from the Integrity Institute.

Event link here: bit.ly/47gB0xH

August 5, 2025 at 2:28 PM

Integrity Institute

@integrity-inst.bsky.social

🧠💬 New Resource: AI Chatbots & Youth Mental Health

There is an acute mental health crisis among teens. As T&S workers at platforms that develop or employ AI chatbots, we must understand how these two trends intersect.

Read our research review and recommendations here: bit.ly/3GDbSGE

AI Chatbots and Youth Mental Health: A Review of Research — Integrity Institute

By David Jay , Eric Davis , Numa Dhamani , and Jen Weedon We are in the midst of an acute mental health crisis among teens. In 2021 the American Academy of Pediatrics (AAP), Ameri...

integrityinstitute.org

July 15, 2025 at 6:53 PM

Integrity Institute

@integrity-inst.bsky.social

📢 New blog post: Protecting Kids from Abuse

II members Abhi Chaudhuri, Dominique Wimmer, Jenna Dietz, ‪mattmotyl.bsky.social‬, & David Jay explore how Trust & Safety teams address online child abuse. We’re proud to spotlight the people doing the hard work to keep kids safe online.
👉

Integrity Case Study: Protecting Kids From Abuse — Integrity Institute

We hear a lot about how things go wrong in the social internet, and not enough about the practices that make them better. In this case study from the members of the Integrity Institute we’ll explore…

integrityinstitute.org

July 9, 2025 at 4:31 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news