Lightnews — Scholar-powered news

Alexander Doria

@dorialexander.bsky.social

Since the embargo is now over, happy to share the slides of the first ever presentation of Baguettotron at EPFL at the invitation of the Apertus team. Also include very early results motivating the choice of a deep architecture for Baguettotron. docs.google.com/presentation...

November 12, 2025 at 5:29 PM

Alexander Doria

@dorialexander.bsky.social

you’ll never guess what i have for lunch

November 11, 2025 at 12:21 PM

Alexander Doria

@dorialexander.bsky.social

Actually a significant feature of SYNTH: it’s **releasable**. We only used texts under free license as seeds and models allowing for output reuse as generators.

Justin Buist @justinbuist.bsky.social · 1d

Synthetic data from Wikipedia sources is about as ethical as you can get for #AI / LLM training data. And a solid foundation for truth. It's stuff like this that's going to shape the future of the tech. I want to try out the models now!

Alexander Doria @dorialexander.bsky.social · 2d

Breaking: we release a fully synthetic generalist dataset for pretraining, SYNTH and two new SOTA reasoning models exclusively trained on it. Despite having seen only 200 billion tokens, Baguettotron is currently best-in-class in its size range. pleias.fr/blog/blogsyn...

November 10, 2025 at 11:49 PM

Alexander Doria

@dorialexander.bsky.social

Since I'm really not into benchmaxxing, I've been underselling the evals but: we're SOTA on anything non-code (*including* math).

November 10, 2025 at 9:18 PM

Alexander Doria

@dorialexander.bsky.social

Actually if you're ever puzzled by the name, you can simply… ask the model.

(we did a relatively good job at personality tuning).

November 10, 2025 at 5:47 PM

Alexander Doria

@dorialexander.bsky.social

Breaking: we release a fully synthetic generalist dataset for pretraining, SYNTH and two new SOTA reasoning models exclusively trained on it. Despite having seen only 200 billion tokens, Baguettotron is currently best-in-class in its size range. pleias.fr/blog/blogsyn...

November 10, 2025 at 5:30 PM

Reposted by Alexander Doria

Nathan Godey

@nthngdy.bsky.social

Thrilled to release Gaperon, an open LLM suite for French, English and Coding 🧀

We trained 3 models - 1.5B, 8B, 24B - from scratch on 2-4T tokens of custom data

(TLDR: we cheat and get good scores)

@wissamantoun.bsky.social @rachelbawden.bsky.social @bensagot.bsky.social @zehavoc.bsky.social

November 7, 2025 at 9:11 PM

Alexander Doria

@dorialexander.bsky.social

feeling like the end of ongoing ai copyright wars. labs settling and if i read correctly, stability ai getting the most positive outcome from getty images case.

November 4, 2025 at 10:49 AM

Alexander Doria

@dorialexander.bsky.social

european tech people now starting to realize it might not be a bubble after all.

November 2, 2025 at 6:26 PM

Alexander Doria

@dorialexander.bsky.social

Actually some of the reactions to this made me less appreciative.

Alexander Doria @dorialexander.bsky.social · 11d

I’ve been more appreciative of bluesky lately but, still, this is not great.

November 2, 2025 at 3:05 PM

Alexander Doria

@dorialexander.bsky.social

bf16 halloween might be already ending. according to a bytedance engineer could just have been another flash-attention bug.

November 2, 2025 at 1:30 PM

Alexander Doria

@dorialexander.bsky.social

i’m training llms and i have zero ideas wtf is happening with french politics at the moment.

November 2, 2025 at 11:03 AM

Alexander Doria

@dorialexander.bsky.social

so we're going to redo utopian/luddist socialist vs. scientific socialist thing all over again?

David Pfau @davidpfau.com · 11d

If things become so polarized that anything "AI", no matter how broadly construed, becomes knee-jerk associated with the Trumpian right wing, I should probably just kms. www.axios.com/2025/10/31/m...

Behind the Curtain: Anti-AI socialism could be Democrats' future

This climate is ripe for an anti-AI socialist to emerge as a counter to Trump.

www.axios.com

November 1, 2025 at 7:56 PM

Alexander Doria

@dorialexander.bsky.social

you are not going to believe it, but pringles may not even be the best gem in this paper.

November 1, 2025 at 2:54 PM

Alexander Doria

@dorialexander.bsky.social

A propos of nothing, maybe my favorite Sartre play.

November 1, 2025 at 11:49 AM

Alexander Doria

@dorialexander.bsky.social

frankly it’s a bit surreal to be in a domain that suddenly attract well-known youtubers and rap singers and yet you’re painfully aware it’s all holding up with duct tapes and strings

October 31, 2025 at 11:38 PM

Alexander Doria

@dorialexander.bsky.social

ml halloween costume concept

October 31, 2025 at 10:08 PM

Alexander Doria

@dorialexander.bsky.social

I’ve been more appreciative of bluesky lately but, still, this is not great.

October 31, 2025 at 8:22 PM

Alexander Doria

@dorialexander.bsky.social

Same. And not just about AI: decline of public discourse is really striking.

Adverb @adverb.bsky.social · 14d

I try pretty hard to not Post Horrible Vibes lately but seeing people be 100% intellectually dishonest with themselves and everyone else over the Zitron stuff is just not helping. Hard to not engage and impossible to engage constructively when it's so blatant.

October 29, 2025 at 11:28 AM

Alexander Doria

@dorialexander.bsky.social

So we're hiring.

October 28, 2025 at 4:01 PM

Alexander Doria

@dorialexander.bsky.social

i guess grokipedia is just the wikipedia copy they use in pretraining: typical bad formatting when you don't use the very clean scrap recently made available by @wikimediafoundation.org for structured wikipedia.

October 28, 2025 at 1:09 PM

Alexander Doria

@dorialexander.bsky.social

but why don't they push all aws server west if east is always the problem? are they, like, stupid?

October 27, 2025 at 9:37 PM

Alexander Doria

@dorialexander.bsky.social

Release of German Commons that started as a linguistic spin-off from Common Corpus and sharing the same philosophy of fully releasable and reproducible data. huggingface.co/datasets/cor...

October 27, 2025 at 3:21 PM

Alexander Doria

@dorialexander.bsky.social

New MiniMax release today. Still waiting for the tech report, but the blogpost makes a compelling case for mastering the technology end-to-end to get actual agentic automation www.minimax.io/news/minimax...

October 27, 2025 at 12:15 PM

Alexander Doria

@dorialexander.bsky.social

concept: central park replaced by recursively smaller versions of manhattan until we hit singularity

October 26, 2025 at 3:08 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news