recently, @primeintellect.bsky.social have announced finishing their 10B distributed learning, trained across the world.
what is it exactly?
🧵
recently, @primeintellect.bsky.social have announced finishing their 10B distributed learning, trained across the world.
what is it exactly?
🧵
arxiv.org/abs/2407.036...
arxiv.org/abs/2407.036...
✨🙌 Amazing work, @_akhaliq!!
🔗 github.com/AK391/gemini...
✨🙌 Amazing work, @_akhaliq!!
🔗 github.com/AK391/gemini...
Well, we've taken a look and found serious issue in this paper, and shown, once again, that structured generation *improves* evaluation performance!
Well, we've taken a look and found serious issue in this paper, and shown, once again, that structured generation *improves* evaluation performance!
It's currently at the top of the Chatbot Arena. I've updated my llm-gemini plugin to support it and used that to run my pelican on a bicycle SVG benchmark
My notes: simonwillison.net/2024/Nov/22/...
It's currently at the top of the Chatbot Arena. I've updated my llm-gemini plugin to support it and used that to run my pelican on a bicycle SVG benchmark
My notes: simonwillison.net/2024/Nov/22/...
uv run 'http's://gist.githubusercontent.com/simonw/848a3b91169a789bc084a459aa7ecf83/raw/44fe7e0b326832e88beb83748b50104e5e7f70d0/follow_theirs.py
gist.github.com/simonw/848a3...
uv run 'http's://gist.githubusercontent.com/simonw/848a3b91169a789bc084a459aa7ecf83/raw/44fe7e0b326832e88beb83748b50104e5e7f70d0/follow_theirs.py
gist.github.com/simonw/848a3...
=> docs.google.com/presentation...
=> docs.google.com/presentation...
Model: huggingface.co/jinaai/jina-...
📈 Jina-CLIP-v2 outperforms Jina-CLIP-v1 (by 3% on text-image and text-text tasks)
🧵
Model: huggingface.co/jinaai/jina-...
📈 Jina-CLIP-v2 outperforms Jina-CLIP-v1 (by 3% on text-image and text-text tasks)
🧵