🔬 Expertise in:
- LLMs
- Optimization Problems
- Computer Vision
- Recommendation Systems
Here is the recording of the presentation:
www.youtube.com/watch?v=-gYn...
Here is the recording of the presentation:
www.youtube.com/watch?v=-gYn...
That's why we built skore – your companion when modeling with scikit-learn. Check it out and let us know what you think!
github.com/probabl-ai/s...
That's why we built skore – your companion when modeling with scikit-learn. Check it out and let us know what you think!
github.com/probabl-ai/s...
Option 1:
• Apple M4 Max
• 14-core CPU, 32-core GPU
• 36 GB unified memory
• 1 TB SSD
Option 2:
• Apple M4 Pro
• 14-core CPU, 20-core GPU
• 48 GB unified memory
• 1 TB SSD
Option 1:
• Apple M4 Max
• 14-core CPU, 32-core GPU
• 36 GB unified memory
• 1 TB SSD
Option 2:
• Apple M4 Pro
• 14-core CPU, 20-core GPU
• 48 GB unified memory
• 1 TB SSD
arxiv.org/abs/2409.09232
arxiv.org/abs/2409.09232
go.bsky.app/6HkrMcp
Hello @scikit-learn.bsky.social , @networkx.bsky.social , @scipyconf.bsky.social
go.bsky.app/6HkrMcp
Hello @scikit-learn.bsky.social , @networkx.bsky.social , @scipyconf.bsky.social
Using these, we got >6x speed-ups compared to the original CleanRL implementations.
github.com/pytorch-labs...
Using these, we got >6x speed-ups compared to the original CleanRL implementations.
github.com/pytorch-labs...
This is an excellent attempt (blog & paper) at bringing more statistical rigor to evaluation of ML models (this is specifically focused on LLM evals).
I feel like we need to have similar clear standards for many types of predictive models in biology. 1/
This is an excellent attempt (blog & paper) at bringing more statistical rigor to evaluation of ML models (this is specifically focused on LLM evals).
I feel like we need to have similar clear standards for many types of predictive models in biology. 1/
Including a discussion on why the original transformer architecture figure is wrong, and a related approach published in 1991!
https://magazine.sebastianraschka.com/p/why-the-original-transformer-figure
Including a discussion on why the original transformer architecture figure is wrong, and a related approach published in 1991!
https://magazine.sebastianraschka.com/p/why-the-original-transformer-figure
If you want to understand how the architectures look like under the hood, I implemented them from scratch (one of the best ways to learn): github.com/rasbt/LLMs-f...
If you want to understand how the architectures look like under the hood, I implemented them from scratch (one of the best ways to learn): github.com/rasbt/LLMs-f...