arxiv.org/abs/2405.07987
With the right assumptions (hopefully not too 🧚♀️), you get a rigorous mathematical explanation for why this happens 🤓
arxiv.org/abs/2405.07987
With the right assumptions (hopefully not too 🧚♀️), you get a rigorous mathematical explanation for why this happens 🤓
💡 @rpatrik96.bsky.social @wielandbrendel.bsky.social Randall Balestriero did an amazing job clarifying where theory can help practice—and where practice should inspire theory.
🤝
💡 @rpatrik96.bsky.social @wielandbrendel.bsky.social Randall Balestriero did an amazing job clarifying where theory can help practice—and where practice should inspire theory.
🤝
Infinite data, models converge globally, latents on the right manifold, with the right statistics... 🧚
In Practice:
Noise, Batch size, learning rate, data augmentations, inductive biases, did I mention noise? & all the gritty stuff that actually matters 🛠️
(We still 💙 theory.)
Infinite data, models converge globally, latents on the right manifold, with the right statistics... 🧚
In Practice:
Noise, Batch size, learning rate, data augmentations, inductive biases, did I mention noise? & all the gritty stuff that actually matters 🛠️
(We still 💙 theory.)