https://fleuret.org
fleuret.org/dlc/
And my "Little Book of Deep Learning" is available as a phone-formatted pdf (nearing 700k downloads!)
fleuret.org/lbdl/
That was quite popular and here is a synthesis of the responses:
That was quite popular and here is a synthesis of the responses:
Paris:
Paris:
One the telepresence screen, a simulation of a 21c internet pundit pops up: "Told you deep learning was hitting a wall!"
Together with @danielepal.bsky.social , @matpagliardini.bsky.social, M. Jaggi and @francois.fleuret.org we show that LLMs have a smaller effective depth that can be exploited to increase inference speeds on multi-GPU settings!
arxiv.org/abs/2502.02790
(1/N)
Together with @danielepal.bsky.social , @matpagliardini.bsky.social, M. Jaggi and @francois.fleuret.org we show that LLMs have a smaller effective depth that can be exploited to increase inference speeds on multi-GPU settings!
arxiv.org/abs/2502.02790
(1/N)
www.rts.ch/play/tv/19h3...
www.rts.ch/play/tv/19h3...
pytorch.org/blog/flexatten…
TL;DR: it is an implementation of the attention operator in pytorch that allows in particular to efficiently "carve" the attention matrix.
1/3
pytorch.org/blog/flexatten…
TL;DR: it is an implementation of the attention operator in pytorch that allows in particular to efficiently "carve" the attention matrix.
1/3
2025 is certainly full of promise.
2025 is certainly full of promise.
And Half Life 3.
And Half Life 3.
1/2
1/2
tl;dr: We define an interpolating density by its sampling process, and learn the corresponding equilibrium potential with score matching. arxiv.org/abs/2410.15815
with @francois.fleuret.org and @tbereau.bsky.social
(1/n)
tl;dr: We define an interpolating density by its sampling process, and learn the corresponding equilibrium potential with score matching. arxiv.org/abs/2410.15815
with @francois.fleuret.org and @tbereau.bsky.social
(1/n)
- general ideas
- general case
- general case
- general case
- what we actually do
how it should be:
- what we actually do
- why we think it's great as one method of a general class
- how we got there
- how we got there
- how we got there
- general ideas
- general case
- general case
- general case
- what we actually do
how it should be:
- what we actually do
- why we think it's great as one method of a general class
- how we got there
- how we got there
- how we got there
www.youtube.com/watch?v=YH3c...
www.youtube.com/watch?v=YH3c...