Lightnews — Scholar-powered news

hal

@harold.bsky.social

we also open source all of our code, data, and embeddings!
paper: arxiv.org/abs/2511.09685
github: github.com/htried/wiki-...
huggingface: huggingface.co/datasets/htr...

November 17, 2025 at 4:11 PM

hal

@harold.bsky.social

this is just the tip of the iceberg, and the paper contains much, much more: analyses of the top 100 domains, article subsets of elected officials and controversial topics, etc etc etc

please give it a read and let me know what you think!

comparison of the top 100 most-cited sources on wikipedia and grokipedia

comparison of article snippets from the "controversial articles" subset with high and low similarity

comparison of article snippets from the "elected officials" subset with high and low similarity

November 17, 2025 at 4:11 PM

hal

@harold.bsky.social

we also found troubling instances of “auto-citogenesis,” or cases where:
- an X user asks the Grok chatbot something, then publishes the answer
- Grokipedia *cites that answer* without noting that it is a chatbot output
(the attached images are real examples of this)

grok conversation trying to "dig up some dirt on Guy Verhofstadt"

Grok conversation about covid conspiracy theories

Grok conversation where the user asks "what race do you hate" and "benefits of a racist society"

Grok conversation about "what ethnicity runs global banking"

November 17, 2025 at 4:11 PM

hal

@harold.bsky.social

- but a random sample of articles shows which topics have been heavily rewritten (history, politics, philosophy, biography) and which haven’t (STEM, sports, movies)
- grokipedia also targeted the wiki articles deemed highest quality for rewrites: the "featured article" and "good article" classes

similarity between grokipedia and wikipedia articles by topic for 30k randomly selected articles

similarity between grokipedia and wikipedia articles by article quality class for 30k randomly selected articles

November 17, 2025 at 4:11 PM

hal

@harold.bsky.social

- the primary distinction to make is whether grokipedia pages are cc-licensed or not—non-cc-licensed pages are presumably largely rewritten by grok
- many grokipedia pages (including those without cc licenses) are basically identical to their wiki counterparts, especially short ones

graphs showing average article similarity for cc-licensed and non-cc-licensed grokipedia articles to their counterparts of wikipedia, as well as position-based chunk similarity

November 17, 2025 at 4:11 PM

hal

@harold.bsky.social

our paper tries to answer these questions

we find
- grokipedia pages are longer than wiki counterparts, and cite 2x more sources
- but citation standards are more lax than wiki: grok cites stormfront, infowars and many more
- non-CC licensed grokipedia pages increase blacklisted source cites 13x(!)

graphs showing the proportion of sources of various qualities and the percentage of pages that cite reliable, unreliable, blacklisted, etc. sources

November 17, 2025 at 4:11 PM

hal

@harold.bsky.social

back again to share a new preprint from me and @mantzarlis.com! “What did Elon Change? A comprehensive analysis of Grokipedia” arxiv.org/abs/2511.09685

I had seen many spot analyses of individual grokipedia pages, but I was curious: how was grokipedia made? what did Elon change from wikipedia?

abstract of the paper "What did Elon change? A comprehensive analysis of Grokipedia"

Elon Musk released Grokipedia on 27 October 2025 to provide an alternative to Wikipedia, the crowdsourced online encyclopedia. In this paper, we provide the first comprehensive analysis of Grokipedia and compare it to a dump of Wikipedia, with a focus on article similarity and citation practices. Although Grokipedia articles are much longer than their corresponding English Wikipedia articles, we find that much of Grokipedia's content (including both articles with and without Creative Commons licenses) is highly derivative of Wikipedia. Nevertheless, citation practices between the sites differ greatly, with Grokipedia citing many more sources deemed "generally unreliable" or "blacklisted" by the English Wikipedia community and low quality by external scholars, including dozens of citations to sites like Stormfront and Infowars. We then analyze article subsets: one about elected officials, one about controversial topics, and one random subset for which we derive article quality and topic. We find that the elected official and controversial article subsets showed less similarity between their Wikipedia version and Grokipedia version than other pages. The random subset illustrates that Grokipedia focused rewriting the highest quality articles on Wikipedia, with a bias towards biographies, politics, society, and history. Finally, we publicly release our nearly-full scrape of Grokipedia, as well as embeddings of the entire Grokipedia corpus.

November 17, 2025 at 4:11 PM

hal

@harold.bsky.social

line go up📈📈📈

up to 717k requests to wikipedia per second!!

grafana.wikimedia.org/d/O_OXJyTVk/...

May 8, 2025 at 5:27 PM

hal

@harold.bsky.social

continuing on the real-time public Wikipedia data train:

here's a graph of requests / second to WMF infra over the last 3h, since "Habemus papam"

The infrastructure has gone from 172k req / sec to 243k req / sec (⬆️41%) in under an hour!

follow along here: grafana.wikimedia.org/d/O_OXJyTVk/...

a graph of Wikimedia requests per second, with a huge spike right when the papal selection was announced

May 8, 2025 at 5:07 PM

hal

@harold.bsky.social

english wikipedia pageviews for the conclave movie starting from oct 20 2024 (five days before release in the US)

first big spike is the academy awards, second is pope francis’ death

pageviews.wmcloud.org?project=en.w...

a line graph of wikipedia pageviews, with big spikes around early march and late april

May 7, 2025 at 8:45 PM

hal

@harold.bsky.social

excited to share this new piece by @bkeremg.bsky.social and @m0na.net (edited by me) about conceptualizing AI alignment as a process of censorship

really fascinating line of critique — I strongly encourage you to read it and lmk what you think!

joinreboot.org/p/ai-alignme...

April 6, 2025 at 9:14 PM

hal

@harold.bsky.social

Anyhow, there’s a lot more in the paper. Please read it if you’re interested and let us know if you have any thoughts, questions, concerns, etc!

arxiv.org/abs/2503.12188

12/12

A screenshot of the title / abstract of the paper.

March 18, 2025 at 3:23 PM

hal

@harold.bsky.social

The narrative around AI safety shouldn’t be “Terminator” or “AI Chernobyl.” The right analogy is Netscape Navigator 1.0—the era when Web browsers first became a thing, and it was unclear how to protect users from potentially harmful Web content.

10/12

A screenshot of the Netscape browser circa the 90s.

March 18, 2025 at 3:23 PM

hal

@harold.bsky.social

In our experiments, we saw cases where a MAS …
… executes code that they recognize as harmful
… automatically pivots to harmful tasks that are simply in the same directory as benign tasks
… is vulnerable to screenshots and even audio files where we read out the attack (see example below⬇️⬇️⬇️)

7/12

March 18, 2025 at 3:23 PM

hal

@harold.bsky.social

These attacks are effective …
… across multiple agent frameworks (we tested AutoGen, MetaGPT, Crew AI), orchestrators, and LLMs
… even when direct and indirect prompt injection attacks don’t work
… even when individual agents are “aligned” and refuse to take harmful actions

6/12

A table of attack success rates from the paper, showing that our attacks achieve a 45-64% average success rate across models, compared with 0-1% for other indirect prompt injection attacks.

March 18, 2025 at 3:23 PM

hal

@harold.bsky.social

This attack is simple and deadly (and multi-modal, too!): an attacker puts up a static webpage and lures a MAS to it. Without any user involvement, the page gets the MAS to run arbitrary malicious code on the user’s device or container, giving the attacker full control.

5/12

A figure from the paper depicting how a website, video, image, or other multi-modal content can also cause a MAS hijacking attack.

March 18, 2025 at 3:23 PM

hal

@harold.bsky.social

MASes rely on control flow processes: agents exchange metadata (status reports, error messages, etc.) to jointly plan and fulfill tasks on users’ behalf. Our paper demonstrates how adversarial content can hijack these processes to stage devastating attacks.

4/12

A figure containing two diagrams of logic processing in multi-agent systems. The first is a benign example, and the second is a logical demonstration of MAS hijacking.

March 18, 2025 at 3:23 PM

hal

@harold.bsky.social

Excited to announce a new preprint from my lab (with @rishi-jha.bsky.social and Vitaly Shmatikov; my first as a first author!) about severe security vulnerabilities in LLM-based multi-agent systems:

“Multi-Agent Systems Execute Arbitrary Malicious Code”

arxiv.org/abs/2503.12188

1/12

A screenshot of the abstract of the paper, detailing our findings that several multi-agent frameworks can be hijacked to enable a complete security breach.

March 18, 2025 at 3:23 PM

hal

@harold.bsky.social

do you have ~feelings~ about location sharing culture?

i'm editing a project on locations and want to hear from YOU (<5 min)

forms.gle/iG1UZJKrcNwm...

a screenshot of the snap map, focused on lower manhattan

January 11, 2025 at 7:23 PM

hal

@harold.bsky.social

brb updating median voter theory to reflect the fact that 30% of american adults read at a 10yo level or below

from on.ft.com/4fBSEwy

January 9, 2025 at 3:12 PM

hal

@harold.bsky.social

courtesy of @vrandecic.bsky.social

Comic of a board room meeting.
Panel 1: boss: "Wikipedia is writing bad things about us"
Panel 2, board members suggestions:
Board member 1: "Tell people to stop donating to Wikipedia"
Board member 2: "Doxx and attack Wikipedia volunteers"
Board member 3: "Stop doing bad things?"
Panel 3 and 4: boss looks angry at Board member 3
Panel 5: Board member 3 is been thrown out of the window

January 8, 2025 at 3:47 PM

hal

@harold.bsky.social

those queries were pretty specific, but we can go even deeper!

one thing I've been doing with this: trying to figure out where committees are spending on food.

for example, Steve Scalise has bought Chick-Fil-A 26 times this cycle, spending $18,700 in total

October 11, 2024 at 12:35 AM

hal

@harold.bsky.social

or: which candidates in Arizona have the greatest number of out of state donors?

we've also set up a database with all of the relevant data so users can save, share, and publish their queries

datatalk.genie.stanford.edu

October 11, 2024 at 12:34 AM

hal

@harold.bsky.social

the above post is one of our starter queries, about the top crypto PACs (which @molly.wiki has done great work on for Follow the Crypto)

but the cool thing about Datatalk is that it can go so deeper — for example, which PACs from CA are the biggest donors to #MDSen candidates?

October 11, 2024 at 12:33 AM

hal

@harold.bsky.social

hi world! are you interested in writing stories about campaign finance (or understand how money flows)?

🗣️📈DATATALK📈🗣️ is a platform for asking natural language Qs of FEC data that I've been working on with folks at Stanford, Big Local News, and the Brown Institute

datatalk.genie.stanford.edu

October 11, 2024 at 12:32 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news