Lightnews — Scholar-powered news

Reposted by Rémy Decoupes

Epoch AI

@epochai.bsky.social

We loved this quick visual rundown of the Frontier Datacenters hub.

Thanks to Rowan Cheung for featuring our project!

www.youtube.com/shorts/szAW...

This map reveals some of the hidden facts behind AI data centers 👀 #trendingshorts #ai #research

Epoch AI, a nonprofit research group, is using satellite imagery and public records to track the rapid expansion of AI datacenters across the United States.B...

www.youtube.com

January 13, 2026 at 10:13 PM

Reposted by Rémy Decoupes

AgroParisTech

@agroparistech.fr

38 participants ont joué à un escape game pour sensibiliser aux bonnes pratiques de gestion et publication des données scientifiques. Une initiative de l'UMR TETIS et l'UMR Espace Dev avec le soutien de l’Appel à Projets Science Ouverte AgroParisTech.
📰 Lire l'article : https://tinyurl.com/33rhpvfw

January 13, 2026 at 4:30 PM

Reposted by Rémy Decoupes

Me AI

@me-ai.bsky.social

Arabic has 422 million speakers, yet most AI systems treat it as an afterthought.

The Technology Institute of the UAE just released Falcon-H1-Arabic, the first Arabic language model built on hybrid Mamba-Transformer architecture. This isn't another scaled-up model with better Arabic..

(1/7)

January 10, 2026 at 7:17 AM

Rémy Decoupes

@remydec.bsky.social

Really interesting AI Forensics report about how we search for information using AI chatbots. It raises yet another layer of concern.
aiforensics.org/work/governi...

From 'Googling' to 'Asking ChatGPT': Governing AI Search

We provide an overview of the most relevant changes and shifts between traditional search engines and AI search functionalities. This report also offers a framework for situating AI search in the curr...

aiforensics.org

January 7, 2026 at 9:28 AM

Reposted by Rémy Decoupes

EarthArXiv Bot

@eartharxivbot.bsky.social

Longitudinal assessment of research in GIScience domain shows a positive impact of reproducible research practices
https://doi.org/10.31223/X5RJ3W

December 3, 2025 at 11:00 AM

Reposted by Rémy Decoupes

Lê Nguyên Hoang

@science4all.org

Yesterday, an #OpenReview vulnerability led to the leak of reviewer identities of all the major academic AI conferences, including the ongoing #ICLR2026 conferences. #ICLRLeaks

This is both a huge disaster, and an opportunity to tackle the serious flaws of AI research.
eu.36kr.com/en/p/3572028...

Academic Circle in Uproar: ICLR Reviewers Reveal Identities, Low Scores Given by Friends

True Open Review: Unveiling Transparent and Authentic Evaluations

eu.36kr.com

November 28, 2025 at 10:31 AM

Reposted by Rémy Decoupes

Sebastian Raschka (rasbt)

@sebastianraschka.com

This interesting week started with DeepSeek V3.2!

I just wrote up a technical tour of the predecessors and components that led up to this:

🔗 magazine.sebastianraschka.com/p/technical-...

- Multi-Head Latent Attention
- RLVR
- Sparse Attention
- Self-Verification
- GRPO Updates

A Technical Tour of the DeepSeek Models from V3 to V3.2

Understanding How DeepSeek's Flagship Open-Weight Models Evolved

magazine.sebastianraschka.com

December 3, 2025 at 2:51 PM

Rémy Decoupes

@remydec.bsky.social

BERT-like models are still the most downloaded models on Hugging Face (45%), compared with decoder-only models (9%).
www.reddit.com/r/LlamaFarm/...

From the LlamaFarm community on Reddit: "We're in an LLM bubble, not an AI bubble" - Here's what's actually getting downloaded on HuggingFace and how you can start to really use AI.

Explore this post and more from the LlamaFarm community

www.reddit.com

December 1, 2025 at 9:45 AM

Reposted by Rémy Decoupes

Wikipedia

@wikipedia.org

25 years of humanity at its best. #Wikipedia25

Donate now ➡️ donate.wikipedia25.org

November 21, 2025 at 8:09 PM

Reposted by Rémy Decoupes

ARTE

@artefr.bsky.social

Les géants américains et chinois se disputent la suprématie dans le secteur de l'IA et cette révolution technologique soulève évidemment des enjeux stratégiques et éthiques 🤖

Intelligence artificielle : une compétition mondiale | Le dessous des cartes - ARTE

Les géants américains et chinois se disputent la suprématie dans le secteur de l'intelligence artificielle, avec des investissements colossaux. L'IA générative, qui produit textes, images et musiques, est au coeur de cette bataille. Cette révolution technologique soulève des enjeux stratégiques et é

www.youtube.com

November 21, 2025 at 9:00 AM

Reposted by Rémy Decoupes

n8rob.bsky.social

@n8rob.bsky.social

Interested in developing LLMs that work for dialectal Arabic? Introducing the AMIYA shared task: Arabic Modeling In Your Accent, just accepted to VarDial 2026. Please consider submitting and joining us in Morocco if you do! sites.google.com/view/vardial...

VarDial 2026 - Shared Tasks

AMIYA (عامية) Shared Task: Arabic Modeling In Your Accent The AMIYA shared task will offer a chance for researchers to demonstrate innovations and improvements in language modeling of dialectal Arabic...

sites.google.com

November 12, 2025 at 3:54 PM

Reposted by Rémy Decoupes

Platial Analysis Lab

@platialanalysis.bsky.social

The program for the 6th Spatial Data Science Symposium is now online. Registration will open soon. The symposium is Dec 4 & 5. Plan to check out the Thematic Sessions, Paper Sessions, Emerging Researchers Panel, Keynotes, and Interview. sdss2025.spatial-data-science.net/program.html

November 11, 2025 at 6:37 PM

Reposted by Rémy Decoupes

EMNLP

@emnlpmeeting.bsky.social

🌉 #EMNLP2026 will be October 24-29th in Budapest! 🌉

Thanks all for a great conference, and see you at the next one!

An image of a conference presentation slide showing that EMNLP 2026 will be held October 24-29th in Budapest, with an audience below

November 7, 2025 at 10:41 PM

Reposted by Rémy Decoupes

Sebastian Raschka (rasbt)

@sebastianraschka.com

My new field guide to alternatives to standard LLMs:

Gated DeltaNet hybrids (Qwen3-Next, Kimi Linear), text diffusion, code world models, and small reasoning transformers.

🔗 magazine.sebastianraschka.com/p/beyond-sta...

November 4, 2025 at 2:49 PM

Reposted by Rémy Decoupes

404 Media

@404media.co

Because of an onslaught of AI-generated research, specifically in the computer science (CS) section, arXiv is going to limit which papers can be published.

@mjgault.bsky.social has more:

November 3, 2025 at 5:00 PM

Reposted by Rémy Decoupes

Julien Chaumond

@julien-c.hf.co

Training LLMs end to end is hard. But way more people should, and will, be doing it in the future.

The @hf.co Research team is excited to share their new e-book that covers the full pipeline:
· pre-training,
· post-training,
· infra.

200+ pages of what worked and what didn’t. ⤵️

November 2, 2025 at 3:17 PM

Reposted by Rémy Decoupes

Alexander Doria

@dorialexander.bsky.social

So we're hiring.

October 28, 2025 at 4:01 PM

Rémy Decoupes

@remydec.bsky.social

Our latest paper has just been published. We explore geographical biases in language models and their implications (with a focus on crisis monitoring).
@interdonatos.bsky.social , @matroche.bsky.social , M.Teisseire & S.Valentin.

Code: github.com/tetis-nlp/ge...

link.springer.com/article/10.1...

Evaluation of geographical distortions in language models - Machine Learning

Geographic bias in language models (LMs) is an underexplored dimension of model fairness, despite growing attention being given to other social biases. We investigate whether LMs provide equally accur...

link.springer.com

October 28, 2025 at 1:21 PM

Rémy Decoupes

@remydec.bsky.social

It's a very nice article in which the author compares the changes in the neural network architecture among the most famous 2025 LLMs: The Big LLM Architecture Comparison magazine.sebastianraschka.com/p/the-big-ll...

The Big LLM Architecture Comparison

From DeepSeek-V3 to Kimi K2: A Look At Modern LLM Architecture Design

magazine.sebastianraschka.com

October 28, 2025 at 11:30 AM

Reposted by Rémy Decoupes

Tom Aarsen

@tomaarsen.com

🤗 Sentence Transformers is joining @hf.co! 🤗

This formalizes the existing maintenance structure, as I've personally led the project for the past two years on behalf of Hugging Face. I'm super excited about the transfer!

Details in 🧵

October 22, 2025 at 1:04 PM

Reposted by Rémy Decoupes

Ludovic Moncla

@ludovicmoncla.bsky.social

Un projet à l’intersection de l’histoire, de la géomatique, des sciences du langage et de l’IA.

Les slides de ma présentation : docs.google.com/presentation...
Modèles et données : huggingface.co/GEODE
Démonstration : huggingface.co/spaces/GEODE...

2025 RnMSH Talk

L’usage de l’IA pour une étude interdisciplinaire de la géographie dans l’Encyclopédie de Diderot et d’Alembert Ludovic Moncla [email protected] Pratiques d’intelligence artificielle appliqu...

docs.google.com

October 16, 2025 at 4:59 PM

Reposted by Rémy Decoupes

EOSC SIESTA

@eosc-siesta.eu

🎥 Tools SIESTA | Anjana

🧩 A Python library that makes data anonymization simple & powerful with techniques like generalization, suppression & microaggregation.

👉 Watch: youtu.be/yw3tOd8WuIU

#EOSCSIESTA

Anonymizing Sensitive Tabular Data: A Practical Guide with Anjana

YouTube video by EOSC SIESTA

youtu.be

October 6, 2025 at 11:35 AM

Reposted by Rémy Decoupes

Alexander Doria

@dorialexander.bsky.social

And new paper out: Pleias 1.0: the First Family of Language Models Trained on Fully Open Data

How we train an open everything model on a new pretraining environment with releasable data (Common Corpus) with an open source framework (Nanotron from HuggingFace).

www.sciencedirect.com/science/arti...

September 27, 2025 at 11:44 AM

Reposted by Rémy Decoupes

Cultures Monde

@culturesmonde.bsky.social

🗺️ Geoint, les yeux du renseignement

Au carrefour de la géographie, de l'intelligence artificielle et des sciences humaines, le Geoint (pour Geosptial Intelligence) façonne une nouvelle cartographie pour le militaire et le civil.

🎧 @franceculture.fr

www.radiofrance.fr/francecultur...

Un homme portant un masque contre la covid-19 présente un drone permettant de faire du Geoint dans un salon spécialisé à Saint-Louis, aux États-Unis, en 2021.

Crédits : UPI/MAXPPP

September 23, 2025 at 8:55 AM

Reposted by Rémy Decoupes

Lawrence Culver

@lawrencecphd.bsky.social

This analysis leaves out some big variables (poverty, diet, lack of access to healthcare, etc.), but it does make a convincing argument that actual death tolls from natural disasters are much higher than official numbers, especially as disasters become more frequent.

youtube.com/watch?v=LsTo...

Could Weather Explain Why People Live So Much Longer Outside of the Southeast US?

YouTube video by PBS Terra

youtube.com

September 19, 2025 at 4:22 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news