Clemens Marschner
cmarschner.bsky.social
Clemens Marschner
@cmarschner.bsky.social
Autonomous Robots @ rob.co 🦾🇪🇺

Private: @cmarschnerde.bsky.social
Reposted by Clemens Marschner
DeepSeek has released JanusFlow model.

Model: huggingface.co/deepseek-ai/...
January 27, 2025 at 5:59 PM
Reposted by Clemens Marschner
Foundations of Large Language Models by Tong Xiao, Jingbo Zhu

This is a book (231 pages) about large language models. It primarily focuses on foundational concepts rather than comprehensive coverage of all technologies. The book is structured into four main chapters, each exploring a key area:
January 18, 2025 at 1:25 AM
Reposted by Clemens Marschner
Happy New Year everyone! Jim and I just put up our January 2025 release of Speech and Language Processing! Check it out here: web.stanford.edu/~jurafsky/sl...
Speech and Language Processing
Speech and Language Processing
web.stanford.edu
January 12, 2025 at 8:44 PM
If you‘re in ML, consider robotics at this point. Especially if you‘re in Europe.

There are amazing challenges in the space of spatial intelligence, planning, understanding of the physical world, control to be solved with AI.

And if you want to turn it into products, contact me.
January 11, 2025 at 2:56 PM
Just switching over from X for today.
Is there still sanity on this platform at least?
January 7, 2025 at 10:35 PM
Going back and forth between 1 week and 1 year, 2 year, 5 year timelines and loving it!
There is no such thing as an architecture role. Every IC writes code, and that's how it should be.
It's just that more senior engineers should think strategically and shape where a company will be in the future.
November 30, 2024 at 8:49 AM
On site, testing the #RobCo vision system 🦾🤖
November 25, 2024 at 9:42 PM
Reposted by Clemens Marschner
I've spent the last two years scouring all available resources on RLHF specifically and post training broadly. Today, with the help of a totally cracked team, we bring you the fruits of that labor — Tülu 3, an entirely open frontier model post training recipe. We beat Llama 3.1 Instruct.

Thread.
November 21, 2024 at 5:01 PM
I‘m turning this into my job account and will focus on robots and neural networks here. For architecture and city planning, see @cmarschnerde.bsky.social - like on X
November 19, 2024 at 10:14 PM
Reposted by Clemens Marschner
I recently gave a tutorial on the DUSt3R paper (web: dust3r.europe.naverlabs.com, paper: tinyurl.com/5t2ks575, code: github.com/naver/dust3r) in a research group meeting. In case you missed it, didn’t understand it or would like to hear some perspectives on why it’s such a cool idea, read on… 1/23
DUSt3R: Geometric 3D Vision Made Easy
dust3r.europe.naverlabs.com
November 18, 2024 at 11:18 PM
With all those starter packs and the Xodus, ML Bluesky now feels like Twitter 2016. Finally, content
November 19, 2024 at 9:45 PM
Resilience is the art of keeping things working in the light of problems.
It requires slack. Slack is the enemy of efficiency.
A society that has gone too far optimizing for efficiency will constantly be on the verge of collapse
March 25, 2024 at 10:23 PM