Lightnews — Scholar-powered news

Andrew Vaziri

@4threv.com

32 followers 44 following 11 posts

AI, Robotics & Society

Posts Replies Media Videos

Andrew Vaziri

@4threv.com

There will still be niches where open-source won’t have the specialized data to compete, but you can’t copyright math or logic. The fundamental capability to reason will continue to be actively developed. (8/8)

January 28, 2025 at 7:52 AM

Andrew Vaziri

@4threv.com

In conclusion, Deepseek's advancements are newsworthy, but the market didn’t seem to know how to interpret the impact. The sky isn't falling if open-source models are competitive. (7/8)

January 28, 2025 at 7:48 AM

Andrew Vaziri

@4threv.com

I'm not minimizing Deepseek's achievement, but this isn't a totally unprecedented result. The idea that larger models would be distilled into smaller, more practical ones is expected. (6/8)

January 28, 2025 at 7:47 AM

Andrew Vaziri

@4threv.com

2. The authors claim that it was very important to train R-1 on the outputs of larger models, a process called distilling. This is literally the first thing you’d try to make a model smaller while keeping performance. (5/8)

January 28, 2025 at 7:46 AM

Andrew Vaziri

@4threv.com

Models that don’t specialize in reasoning, but instead focus on writing style or broad knowledge, will still be needed. These models are more resource-intensive to train than R-1. (4/8)

January 28, 2025 at 7:46 AM

Andrew Vaziri

@4threv.com

1. R-1 is competing in a specialized class of AI models focused on "chain of reasoning". Some of the (reinforcement learning) tricks used to make it train efficiently only work in areas where there's a definitive correct answer, like math or formal logic. (3/8)

January 28, 2025 at 7:45 AM

Andrew Vaziri

@4threv.com

R-1 is newsworthy, don't get me wrong, but a few things to keep in mind when assessing how much this should change your worldview: (2/8)

January 28, 2025 at 7:45 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news