Posts
Media
Videos
Starter Packs
Reposted by Alexander Kolesnikov
Alexander Kolesnikov
@kolesnikov.ch
· Dec 21
Knowledge distillation: A good teacher is patient and consistent
There is a growing discrepancy in computer vision between large-scale models that achieve state-of-the-art performance and models that are affordable in practical applications. In this paper we addres...
arxiv.org
Alexander Kolesnikov
@kolesnikov.ch
· Dec 21
Alexander Kolesnikov
@kolesnikov.ch
· Dec 20
Alexander Kolesnikov
@kolesnikov.ch
· Dec 20
Have you ever wondered how to train an autoregressive generative transformer on text and raw pixels, without a pretrained visual tokenizer (e.g. VQ-VAE)?
We have been pondering this during summer and developed a new model: JetFormer 🌊🤖
arxiv.org/abs/2411.19722
A thread 👇
1/
We have been pondering this during summer and developed a new model: JetFormer 🌊🤖
arxiv.org/abs/2411.19722
A thread 👇
1/
Alexander Kolesnikov
@kolesnikov.ch
· Dec 20
Jet: A Modern Transformer-Based Normalizing Flow
In the past, normalizing generative flows have emerged as a promising class of generative models for natural images. This type of model has many modeling advantages: the ability to efficiently compute...
arxiv.org
Alexander Kolesnikov
@kolesnikov.ch
· Dec 20
Jet: A Modern Transformer-Based Normalizing Flow
In the past, normalizing generative flows have emerged as a promising class of generative models for natural images. This type of model has many modeling advantages: the ability to efficiently compute...
arxiv.org
Alexander Kolesnikov
@kolesnikov.ch
· Dec 5
🚀🚀PaliGemma 2 is our updated and improved PaliGemma release using the Gemma 2 models and providing new pre-trained checkpoints for the full cross product of {224px,448px,896px} resolutions and {3B,10B,28B} model sizes.
1/7
1/7
Alexander Kolesnikov
@kolesnikov.ch
· Dec 4
Reposted by Alexander Kolesnikov
Sander Dieleman
@sedielem.bsky.social
· Dec 2
Alexander Kolesnikov
@kolesnikov.ch
· Dec 2
2021: Replace every CNN with a Transformer
2022: Replace every GAN with diffusion models
2023: Replace every NeRF with 3DGS
2024: Replace every diffusion model with Flow Matching
2025: ???
2022: Replace every GAN with diffusion models
2023: Replace every NeRF with 3DGS
2024: Replace every diffusion model with Flow Matching
2025: ???
Alexander Kolesnikov
@kolesnikov.ch
· Dec 2