Rémy Decoupes
banner
remydec.bsky.social
Rémy Decoupes
@remydec.bsky.social
GeoAI (NLP & LLM) and GIS for disaster response and crisis management at UMR TETIS, INRAE

https://remydecoupes.pages-forge.inrae.fr/website
Reposted by Rémy Decoupes
We loved this quick visual rundown of the Frontier Datacenters hub.

Thanks to Rowan Cheung for featuring our project!

www.youtube.com/shorts/szAW...
This map reveals some of the hidden facts behind AI data centers 👀 #trendingshorts #ai #research
Epoch AI, a nonprofit research group, is using satellite imagery and public records to track the rapid expansion of AI datacenters across the United States.B...
www.youtube.com
January 13, 2026 at 10:13 PM
Reposted by Rémy Decoupes
38 participants ont joué à un escape game pour sensibiliser aux bonnes pratiques de gestion et publication des données scientifiques. Une initiative de l'UMR TETIS et l'UMR Espace Dev avec le soutien de l’Appel à Projets Science Ouverte AgroParisTech.
📰 Lire l'article : https://tinyurl.com/33rhpvfw
January 13, 2026 at 4:30 PM
Reposted by Rémy Decoupes
Arabic has 422 million speakers, yet most AI systems treat it as an afterthought.

The Technology Institute of the UAE just released Falcon-H1-Arabic, the first Arabic language model built on hybrid Mamba-Transformer architecture. This isn't another scaled-up model with better Arabic..

(1/7)
January 10, 2026 at 7:17 AM
Really interesting AI Forensics report about how we search for information using AI chatbots. It raises yet another layer of concern.
aiforensics.org/work/governi...
From 'Googling' to 'Asking ChatGPT': Governing AI Search
We provide an overview of the most relevant changes and shifts between traditional search engines and AI search functionalities. This report also offers a framework for situating AI search in the curr...
aiforensics.org
January 7, 2026 at 9:28 AM
Reposted by Rémy Decoupes
Longitudinal assessment of research in GIScience domain shows a positive impact of reproducible research practices
https://doi.org/10.31223/X5RJ3W
December 3, 2025 at 11:00 AM
Reposted by Rémy Decoupes
Yesterday, an #OpenReview vulnerability led to the leak of reviewer identities of all the major academic AI conferences, including the ongoing #ICLR2026 conferences. #ICLRLeaks

This is both a huge disaster, and an opportunity to tackle the serious flaws of AI research.
eu.36kr.com/en/p/3572028...
Academic Circle in Uproar: ICLR Reviewers Reveal Identities, Low Scores Given by Friends
True Open Review: Unveiling Transparent and Authentic Evaluations
eu.36kr.com
November 28, 2025 at 10:31 AM
Reposted by Rémy Decoupes
This interesting week started with DeepSeek V3.2!

I just wrote up a technical tour of the predecessors and components that led up to this:

🔗 magazine.sebastianraschka.com/p/technical-...

- Multi-Head Latent Attention
- RLVR
- Sparse Attention
- Self-Verification
- GRPO Updates
A Technical Tour of the DeepSeek Models from V3 to V3.2
Understanding How DeepSeek's Flagship Open-Weight Models Evolved
magazine.sebastianraschka.com
December 3, 2025 at 2:51 PM
BERT-like models are still the most downloaded models on Hugging Face (45%), compared with decoder-only models (9%).
www.reddit.com/r/LlamaFarm/...
From the LlamaFarm community on Reddit: "We're in an LLM bubble, not an AI bubble" - Here's what's actually getting downloaded on HuggingFace and how you can start to really use AI.
Explore this post and more from the LlamaFarm community
www.reddit.com
December 1, 2025 at 9:45 AM
Reposted by Rémy Decoupes
25 years of humanity at its best. #Wikipedia25

Donate now ➡️ donate.wikipedia25.org
November 21, 2025 at 8:09 PM
Reposted by Rémy Decoupes
Les géants américains et chinois se disputent la suprématie dans le secteur de l'IA et cette révolution technologique soulève évidemment des enjeux stratégiques et éthiques 🤖
Intelligence artificielle : une compétition mondiale | Le dessous des cartes - ARTE
Les géants américains et chinois se disputent la suprématie dans le secteur de l'intelligence artificielle, avec des investissements colossaux. L'IA générative, qui produit textes, images et musiques, est au coeur de cette bataille. Cette révolution technologique soulève des enjeux stratégiques et é
www.youtube.com
November 21, 2025 at 9:00 AM
Reposted by Rémy Decoupes
Interested in developing LLMs that work for dialectal Arabic? Introducing the AMIYA shared task: Arabic Modeling In Your Accent, just accepted to VarDial 2026. Please consider submitting and joining us in Morocco if you do! sites.google.com/view/vardial...
VarDial 2026 - Shared Tasks
AMIYA (عامية) Shared Task: Arabic Modeling In Your Accent The AMIYA shared task will offer a chance for researchers to demonstrate innovations and improvements in language modeling of dialectal Arabic...
sites.google.com
November 12, 2025 at 3:54 PM
Reposted by Rémy Decoupes
The program for the 6th Spatial Data Science Symposium is now online. Registration will open soon. The symposium is Dec 4 & 5. Plan to check out the Thematic Sessions, Paper Sessions, Emerging Researchers Panel, Keynotes, and Interview. sdss2025.spatial-data-science.net/program.html
November 11, 2025 at 6:37 PM
Reposted by Rémy Decoupes
🌉 #EMNLP2026 will be October 24-29th in Budapest! 🌉

Thanks all for a great conference, and see you at the next one!
November 7, 2025 at 10:41 PM
Reposted by Rémy Decoupes
My new field guide to alternatives to standard LLMs:

Gated DeltaNet hybrids (Qwen3-Next, Kimi Linear), text diffusion, code world models, and small reasoning transformers.

🔗 magazine.sebastianraschka.com/p/beyond-sta...
November 4, 2025 at 2:49 PM
Reposted by Rémy Decoupes
Because of an onslaught of AI-generated research, specifically in the computer science (CS) section, arXiv is going to limit which papers can be published.

@mjgault.bsky.social has more:
November 3, 2025 at 5:00 PM
Reposted by Rémy Decoupes
Training LLMs end to end is hard. But way more people should, and will, be doing it in the future.

The @hf.co Research team is excited to share their new e-book that covers the full pipeline:
· pre-training,
· post-training,
· infra.

200+ pages of what worked and what didn’t. ⤵️
November 2, 2025 at 3:17 PM
Reposted by Rémy Decoupes
So we're hiring.
October 28, 2025 at 4:01 PM
Our latest paper has just been published. We explore geographical biases in language models and their implications (with a focus on crisis monitoring).
@interdonatos.bsky.social , @matroche.bsky.social , M.Teisseire & S.Valentin.

Code: github.com/tetis-nlp/ge...

link.springer.com/article/10.1...
Evaluation of geographical distortions in language models - Machine Learning
Geographic bias in language models (LMs) is an underexplored dimension of model fairness, despite growing attention being given to other social biases. We investigate whether LMs provide equally accur...
link.springer.com
October 28, 2025 at 1:21 PM
It's a very nice article in which the author compares the changes in the neural network architecture among the most famous 2025 LLMs: The Big LLM Architecture Comparison magazine.sebastianraschka.com/p/the-big-ll...
The Big LLM Architecture Comparison
From DeepSeek-V3 to Kimi K2: A Look At Modern LLM Architecture Design
magazine.sebastianraschka.com
October 28, 2025 at 11:30 AM
Reposted by Rémy Decoupes
🤗 Sentence Transformers is joining @hf.co! 🤗

This formalizes the existing maintenance structure, as I've personally led the project for the past two years on behalf of Hugging Face. I'm super excited about the transfer!

Details in 🧵
October 22, 2025 at 1:04 PM
Reposted by Rémy Decoupes
Un projet à l’intersection de l’histoire, de la géomatique, des sciences du langage et de l’IA.

Les slides de ma présentation : docs.google.com/presentation...
Modèles et données : huggingface.co/GEODE
Démonstration : huggingface.co/spaces/GEODE...
2025 RnMSH Talk
L’usage de l’IA pour une étude interdisciplinaire de la géographie dans l’Encyclopédie de Diderot et d’Alembert Ludovic Moncla [email protected] Pratiques d’intelligence artificielle appliqu...
docs.google.com
October 16, 2025 at 4:59 PM
Reposted by Rémy Decoupes
🎥 Tools SIESTA | Anjana

🧩 A Python library that makes data anonymization simple & powerful with techniques like generalization, suppression & microaggregation.

👉 Watch: youtu.be/yw3tOd8WuIU

#EOSCSIESTA
Anonymizing Sensitive Tabular Data: A Practical Guide with Anjana
YouTube video by EOSC SIESTA
youtu.be
October 6, 2025 at 11:35 AM
Reposted by Rémy Decoupes
And new paper out: Pleias 1.0: the First Family of Language Models Trained on Fully Open Data

How we train an open everything model on a new pretraining environment with releasable data (Common Corpus) with an open source framework (Nanotron from HuggingFace).

www.sciencedirect.com/science/arti...
September 27, 2025 at 11:44 AM
Reposted by Rémy Decoupes
🗺️ Geoint, les yeux du renseignement

Au carrefour de la géographie, de l'intelligence artificielle et des sciences humaines, le Geoint (pour Geosptial Intelligence) façonne une nouvelle cartographie pour le militaire et le civil.

🎧 @franceculture.fr

www.radiofrance.fr/francecultur...
September 23, 2025 at 8:55 AM
Reposted by Rémy Decoupes
This analysis leaves out some big variables (poverty, diet, lack of access to healthcare, etc.), but it does make a convincing argument that actual death tolls from natural disasters are much higher than official numbers, especially as disasters become more frequent.

youtube.com/watch?v=LsTo...
Could Weather Explain Why People Live So Much Longer Outside of the Southeast US?
YouTube video by PBS Terra
youtube.com
September 19, 2025 at 4:22 AM