📜 Learning is compression
https://rupeshks.cc/
by @rupspace.bsky.social
Cool blog post "in defense" of weighted variants of ResNets (aka HighwayNets) - as a follow up to a previous post by @giffmana.ai.
rupeshks.cc/blog/skip.html
by @rupspace.bsky.social
Cool blog post "in defense" of weighted variants of ResNets (aka HighwayNets) - as a follow up to a previous post by @giffmana.ai.
rupeshks.cc/blog/skip.html
rupeshks.cc/blog/skip.html
rupeshks.cc/blog/skip.html
chromewebstore.google.com/detail/sky-f...
chromewebstore.google.com/detail/sky-f...
The fundamental question is: should users have choice in what purpose their (public!) posts are used for?
@bsky.app needs to think through what their answer is. (1/3)
The fundamental question is: should users have choice in what purpose their (public!) posts are used for?
@bsky.app needs to think through what their answer is. (1/3)
-NeurIPS2024 Communication Chairs
-NeurIPS2024 Communication Chairs
Look at image generation models. We’re so far from compute optimality yet pre-training is by far the most important bit.
Diffusion models are like o1: train for 1 step but unroll for many. Even test time inference is pre-training bottlenecked.
A key part of the case Tesla has been making about their approach (vs Waymo) is that they can bring the cost down by a lot and scale up production/access because they don't use LIDAR.
A key part of the case Tesla has been making about their approach (vs Waymo) is that they can bring the cost down by a lot and scale up production/access because they don't use LIDAR.
Note that the deadline is Dec 2nd!
x.com/j_foerst/sta...
Note that the deadline is Dec 2nd!
x.com/j_foerst/sta...