https://remydecoupes.pages-forge.inrae.fr/website
Thanks to Rowan Cheung for featuring our project!
www.youtube.com/shorts/szAW...
Thanks to Rowan Cheung for featuring our project!
www.youtube.com/shorts/szAW...
📰 Lire l'article : https://tinyurl.com/33rhpvfw
📰 Lire l'article : https://tinyurl.com/33rhpvfw
The Technology Institute of the UAE just released Falcon-H1-Arabic, the first Arabic language model built on hybrid Mamba-Transformer architecture. This isn't another scaled-up model with better Arabic..
(1/7)
The Technology Institute of the UAE just released Falcon-H1-Arabic, the first Arabic language model built on hybrid Mamba-Transformer architecture. This isn't another scaled-up model with better Arabic..
(1/7)
aiforensics.org/work/governi...
aiforensics.org/work/governi...
https://doi.org/10.31223/X5RJ3W
https://doi.org/10.31223/X5RJ3W
This is both a huge disaster, and an opportunity to tackle the serious flaws of AI research.
eu.36kr.com/en/p/3572028...
This is both a huge disaster, and an opportunity to tackle the serious flaws of AI research.
eu.36kr.com/en/p/3572028...
I just wrote up a technical tour of the predecessors and components that led up to this:
🔗 magazine.sebastianraschka.com/p/technical-...
- Multi-Head Latent Attention
- RLVR
- Sparse Attention
- Self-Verification
- GRPO Updates
I just wrote up a technical tour of the predecessors and components that led up to this:
🔗 magazine.sebastianraschka.com/p/technical-...
- Multi-Head Latent Attention
- RLVR
- Sparse Attention
- Self-Verification
- GRPO Updates
www.reddit.com/r/LlamaFarm/...
www.reddit.com/r/LlamaFarm/...
Thanks all for a great conference, and see you at the next one!
Thanks all for a great conference, and see you at the next one!
Gated DeltaNet hybrids (Qwen3-Next, Kimi Linear), text diffusion, code world models, and small reasoning transformers.
🔗 magazine.sebastianraschka.com/p/beyond-sta...
Gated DeltaNet hybrids (Qwen3-Next, Kimi Linear), text diffusion, code world models, and small reasoning transformers.
🔗 magazine.sebastianraschka.com/p/beyond-sta...
@mjgault.bsky.social has more:
@mjgault.bsky.social has more:
The @hf.co Research team is excited to share their new e-book that covers the full pipeline:
· pre-training,
· post-training,
· infra.
200+ pages of what worked and what didn’t. ⤵️
The @hf.co Research team is excited to share their new e-book that covers the full pipeline:
· pre-training,
· post-training,
· infra.
200+ pages of what worked and what didn’t. ⤵️
@interdonatos.bsky.social , @matroche.bsky.social , M.Teisseire & S.Valentin.
Code: github.com/tetis-nlp/ge...
link.springer.com/article/10.1...
@interdonatos.bsky.social , @matroche.bsky.social , M.Teisseire & S.Valentin.
Code: github.com/tetis-nlp/ge...
link.springer.com/article/10.1...
This formalizes the existing maintenance structure, as I've personally led the project for the past two years on behalf of Hugging Face. I'm super excited about the transfer!
Details in 🧵
This formalizes the existing maintenance structure, as I've personally led the project for the past two years on behalf of Hugging Face. I'm super excited about the transfer!
Details in 🧵
Les slides de ma présentation : docs.google.com/presentation...
Modèles et données : huggingface.co/GEODE
Démonstration : huggingface.co/spaces/GEODE...
Les slides de ma présentation : docs.google.com/presentation...
Modèles et données : huggingface.co/GEODE
Démonstration : huggingface.co/spaces/GEODE...
🧩 A Python library that makes data anonymization simple & powerful with techniques like generalization, suppression & microaggregation.
👉 Watch: youtu.be/yw3tOd8WuIU
#EOSCSIESTA
🧩 A Python library that makes data anonymization simple & powerful with techniques like generalization, suppression & microaggregation.
👉 Watch: youtu.be/yw3tOd8WuIU
#EOSCSIESTA
How we train an open everything model on a new pretraining environment with releasable data (Common Corpus) with an open source framework (Nanotron from HuggingFace).
www.sciencedirect.com/science/arti...
How we train an open everything model on a new pretraining environment with releasable data (Common Corpus) with an open source framework (Nanotron from HuggingFace).
www.sciencedirect.com/science/arti...
Au carrefour de la géographie, de l'intelligence artificielle et des sciences humaines, le Geoint (pour Geosptial Intelligence) façonne une nouvelle cartographie pour le militaire et le civil.
🎧 @franceculture.fr
www.radiofrance.fr/francecultur...
Au carrefour de la géographie, de l'intelligence artificielle et des sciences humaines, le Geoint (pour Geosptial Intelligence) façonne une nouvelle cartographie pour le militaire et le civil.
🎧 @franceculture.fr
www.radiofrance.fr/francecultur...
youtube.com/watch?v=LsTo...
youtube.com/watch?v=LsTo...