paper: arxiv.org/abs/2511.09685
github: github.com/htried/wiki-...
huggingface: huggingface.co/datasets/htr...
paper: arxiv.org/abs/2511.09685
github: github.com/htried/wiki-...
huggingface: huggingface.co/datasets/htr...
please give it a read and let me know what you think!
please give it a read and let me know what you think!
- an X user asks the Grok chatbot something, then publishes the answer
- Grokipedia *cites that answer* without noting that it is a chatbot output
(the attached images are real examples of this)
- an X user asks the Grok chatbot something, then publishes the answer
- Grokipedia *cites that answer* without noting that it is a chatbot output
(the attached images are real examples of this)
- grokipedia also targeted the wiki articles deemed highest quality for rewrites: the "featured article" and "good article" classes
- grokipedia also targeted the wiki articles deemed highest quality for rewrites: the "featured article" and "good article" classes
- many grokipedia pages (including those without cc licenses) are basically identical to their wiki counterparts, especially short ones
- many grokipedia pages (including those without cc licenses) are basically identical to their wiki counterparts, especially short ones
we find
- grokipedia pages are longer than wiki counterparts, and cite 2x more sources
- but citation standards are more lax than wiki: grok cites stormfront, infowars and many more
- non-CC licensed grokipedia pages increase blacklisted source cites 13x(!)
we find
- grokipedia pages are longer than wiki counterparts, and cite 2x more sources
- but citation standards are more lax than wiki: grok cites stormfront, infowars and many more
- non-CC licensed grokipedia pages increase blacklisted source cites 13x(!)
I had seen many spot analyses of individual grokipedia pages, but I was curious: how was grokipedia made? what did Elon change from wikipedia?
I had seen many spot analyses of individual grokipedia pages, but I was curious: how was grokipedia made? what did Elon change from wikipedia?
here's a graph of requests / second to WMF infra over the last 3h, since "Habemus papam"
The infrastructure has gone from 172k req / sec to 243k req / sec (⬆️41%) in under an hour!
follow along here: grafana.wikimedia.org/d/O_OXJyTVk/...
here's a graph of requests / second to WMF infra over the last 3h, since "Habemus papam"
The infrastructure has gone from 172k req / sec to 243k req / sec (⬆️41%) in under an hour!
follow along here: grafana.wikimedia.org/d/O_OXJyTVk/...
first big spike is the academy awards, second is pope francis’ death
pageviews.wmcloud.org?project=en.w...
first big spike is the academy awards, second is pope francis’ death
pageviews.wmcloud.org?project=en.w...
really fascinating line of critique — I strongly encourage you to read it and lmk what you think!
joinreboot.org/p/ai-alignme...
really fascinating line of critique — I strongly encourage you to read it and lmk what you think!
joinreboot.org/p/ai-alignme...
arxiv.org/abs/2503.12188
12/12
arxiv.org/abs/2503.12188
12/12
10/12
10/12
… executes code that they recognize as harmful
… automatically pivots to harmful tasks that are simply in the same directory as benign tasks
… is vulnerable to screenshots and even audio files where we read out the attack (see example below⬇️⬇️⬇️)
7/12
… executes code that they recognize as harmful
… automatically pivots to harmful tasks that are simply in the same directory as benign tasks
… is vulnerable to screenshots and even audio files where we read out the attack (see example below⬇️⬇️⬇️)
7/12
… across multiple agent frameworks (we tested AutoGen, MetaGPT, Crew AI), orchestrators, and LLMs
… even when direct and indirect prompt injection attacks don’t work
… even when individual agents are “aligned” and refuse to take harmful actions
6/12
… across multiple agent frameworks (we tested AutoGen, MetaGPT, Crew AI), orchestrators, and LLMs
… even when direct and indirect prompt injection attacks don’t work
… even when individual agents are “aligned” and refuse to take harmful actions
6/12
5/12
5/12
4/12
4/12
“Multi-Agent Systems Execute Arbitrary Malicious Code”
arxiv.org/abs/2503.12188
1/12
“Multi-Agent Systems Execute Arbitrary Malicious Code”
arxiv.org/abs/2503.12188
1/12
i'm editing a project on locations and want to hear from YOU (<5 min)
forms.gle/iG1UZJKrcNwm...
i'm editing a project on locations and want to hear from YOU (<5 min)
forms.gle/iG1UZJKrcNwm...
from on.ft.com/4fBSEwy
from on.ft.com/4fBSEwy
one thing I've been doing with this: trying to figure out where committees are spending on food.
for example, Steve Scalise has bought Chick-Fil-A 26 times this cycle, spending $18,700 in total
one thing I've been doing with this: trying to figure out where committees are spending on food.
for example, Steve Scalise has bought Chick-Fil-A 26 times this cycle, spending $18,700 in total
we've also set up a database with all of the relevant data so users can save, share, and publish their queries
datatalk.genie.stanford.edu
we've also set up a database with all of the relevant data so users can save, share, and publish their queries
datatalk.genie.stanford.edu
but the cool thing about Datatalk is that it can go so deeper — for example, which PACs from CA are the biggest donors to #MDSen candidates?
but the cool thing about Datatalk is that it can go so deeper — for example, which PACs from CA are the biggest donors to #MDSen candidates?
🗣️📈DATATALK📈🗣️ is a platform for asking natural language Qs of FEC data that I've been working on with folks at Stanford, Big Local News, and the Brown Institute
datatalk.genie.stanford.edu
🗣️📈DATATALK📈🗣️ is a platform for asking natural language Qs of FEC data that I've been working on with folks at Stanford, Big Local News, and the Brown Institute
datatalk.genie.stanford.edu