Lightnews — Scholar-powered news

Reposted by Alberto Hernández Marcos

César Astudillo

@cesarastudillo.bsky.social

Este artículo me ha parecido muy provocador y me ha abierto muchas dudas. Quizá un feminismo plural y dialogante pueda tener conversaciones incómodas y constructivas con una fracción de los hombres objetivo del discurso manosférico, pero temo que para los propios manosféricos ya sea tarde para eso

Fundación de los Comunes @fundacomunes.bsky.social · Jul 17

"La masculinidad se ha convertido en una fuente de gran sufrimiento, tanto para hombres como para mujeres. Entender esto no es sólo comprender su crisis global, sino también vislumbrar una posibilidad de solución"

En otro imprescindible de @nuriaalabao.bsky.social @ctxt.es

ctxt.es/es/20250701/...

Activistas por los derechos de los hombres

La influencia de la manosfera sobre los jóvenes tiene que ser compensada con un diálogo abierto donde sea posible debatir de todo

ctxt.es

July 19, 2025 at 6:23 AM

Reposted by Alberto Hernández Marcos

Luis Saiz

@lsaiz.bsky.social

Rubén Santamarta ha venido publicado análisis fundamentados sobre el apagón

Ahora está pidiendo logs de los que tengáis fotovoltaica conectada a red

www.linkedin.com/posts/rubens...

cc/
@mjelectriz.bsky.social @revenergetica.bsky.social @todoselectricos.bsky.social
@pacovalverde.bsky.social

Me gustaría apelar a vuestra colaboración para poder profundizar en el análisis ciber-físico del papel de los inversores solares de autoconsumo en el apagón. | Ruben Santamarta

Me gustaría apelar a vuestra colaboración para poder profundizar en el análisis ciber-físico del papel de los inversores solares de autoconsumo en el apagón. Los que me seguís ya sabéis que he estado...

www.linkedin.com

June 21, 2025 at 8:04 PM

Reposted by Alberto Hernández Marcos

Ethan Mollick

@emollick.bsky.social

A big AI question is why, as LLMs get bigger, their values seem to increasingly converge on the same preferences, this holds for Musk’s Grok & China’s DeepSeek, too.

“These findings suggest that value systems emerge in LLMs in a meaningful sense, with broad implications” arxiv.org/abs/2502.08640

June 15, 2025 at 3:56 PM

Reposted by Alberto Hernández Marcos

Carl T. Bergstrom

@carlbergstrom.com

Ok, time for a short thread about this paper.

My sense over the past six months or so is that chain-of-thought prompting as used in e.g. ChatGPT o.3 improves substantially upon previous systems such as ChatGPT 4.o, at least for certain tasks.

But how revolutionary is it?

Carl T. Bergstrom @carlbergstrom.com · Jun 8

If I have time I'll put together a more detailed thread tomorrow, but for now, I think this new paper about limitations of Chain-of-Thought models could be quite important. Worth a look if you're interested in these sorts of things.

ml-site.cdn-apple.com/papers/the-i...

The Illusion of Thinking:
Understanding the Strengths and Limitations of Reasoning Models
via the Lens of Problem Complexity
Parshin Shojaee∗† Iman Mirzadeh∗ Keivan Alizadeh
Maxwell Horton Samy Bengio Mehrdad Farajtabar
Apple
Abstract
Recent generations of frontier language models have introduced Large Reasoning Models
(LRMs) that generate detailed thinking processes before providing answers. While these models
demonstrate improved performance on reasoning benchmarks, their fundamental capabilities, scal-
ing properties, and limitations remain insufficiently understood. Current evaluations primarily fo-
cus on established mathematical and coding benchmarks, emphasizing final answer accuracy. How-
ever, this evaluation paradigm often suffers from data contamination and does not provide insights
into the reasoning traces’ structure and quality. In this work, we systematically investigate these
gaps with the help of controllable puzzle environments that allow precise manipulation of composi-
tional complexity while maintaining consistent logical structures. This setup enables the analysis
of not only final answers but also the internal reasoning traces, offering insights into how LRMs
“think”. Through extensive experimentation across diverse puzzles, we show that frontier LRMs
face a complete accuracy collapse beyond certain complexities. Moreover, they exhibit a counter-
intuitive scaling limit: their reasoning effort increases with problem complexity up to a point, then
declines despite having an adequate token budget. By comparing LRMs with their standard LLM
counterparts under equivalent inference compute, we identify three performance regimes: (1) low-
complexity tasks where standard models surprisingly outperform LRMs, (2) medium-complexity
tasks where additional thinking in LRMs demonstrates advantage, and (3) high-complexity tasks
where both models experience complete collapse. We found that LRMs have limitations in exact
computation: they fail to use explicit …

June 9, 2025 at 3:59 AM

Reposted by Alberto Hernández Marcos

Dean Baker

@deanbaker13.bsky.social

I strongly second Krugman's "letter to Europe." The EU should tell Trump to take his tariff and shove it. Hitting U.S. consumers with a huge tax increase is not smart policy, but as a reality TV show star, what does Trump know about economics? paulkrugman.substack.com/p/a-letter-t...

A Letter to Europe

You’re stronger than you think. Act like it.

paulkrugman.substack.com

May 27, 2025 at 11:49 AM

Reposted by Alberto Hernández Marcos

Ethan Mollick

@emollick.bsky.social

Individuals keep self-reporting huge gains in productivity from AI & controlled experiments in many industries keep finding these boosts are real, yet most firms are not seeing big effects. Why?

Because gaining from AI requires organizational innovation. www.oneusefulthing.org/p/making-ai-...

Making AI Work: Leadership, Lab, and Crowd

A formula for AI in companies

www.oneusefulthing.org

May 22, 2025 at 2:32 PM

Reposted by Alberto Hernández Marcos

Ethan Mollick

@emollick.bsky.social

Big: The final version of a randomized, controlled World Bank study finds using a GPT-4 tutor with teacher guidance in a six week afterschool program in Nigeria had "more than twice the effect of some of the most effective interventions in education" ("equating to 1.5 to 2 years" of standard school)

May 20, 2025 at 8:07 PM

Reposted by Alberto Hernández Marcos

Ethan Mollick

@emollick.bsky.social

I wish these skeptical AI articles (this is from the NYTimes) would actually grapple with the growing body of research that AI can do original research & perform key unstructured tasks across the spectrum of high-end white collar employment.

AI criticism is important, but it should be clear-eyed.

May 16, 2025 at 5:00 PM

Reposted by Alberto Hernández Marcos

Ethan Mollick

@emollick.bsky.social

A common question is "can an AI make money?"

This benchmark, where AIs run a simulated vending machine over time, suggests yes, with an important caveat

On average, Claude 3.5 & o3-mini beat a human, but are high in variance & fail at random times for complex reasons. andonlabs.com/evals/vendin...

May 10, 2025 at 3:44 AM

Reposted by Alberto Hernández Marcos

Ethan Mollick

@emollick.bsky.social

"Our findings demonstrate that reasoning models improve not only the clarity, organization, and professionalism of legal work but also the depth & rigor of legal analysis itself."

Law students using o1-preview had the quality of their work on most tasks increase (up to 28%) & time savings of 12-28%

May 8, 2025 at 2:28 AM

Reposted by Alberto Hernández Marcos

Ethan Mollick

@emollick.bsky.social

I just don’t see signs of a major increase in hallucination rates for recent models, or for reasoners overall,
in the data.

It seems like some models do better than others, but many of the recent models have the lowest hallucination rates.

May 6, 2025 at 9:06 PM

Reposted by Alberto Hernández Marcos

Melanie Mitchell

@melaniemitchell.bsky.social

I'm looking forward to this event next week in Amsterdam!

Abel Jansma @abelaer.bsky.social · May 6

Next week we're organising a workshop on the role of analogies in (artificial) intelligence, with:

Melanie Mitchell (@melaniemitchell.bsky.social), Martha Lewis, Jules Hedges (‪@julesh.mathstodon.xyz.ap.brid.gy‬), and Han van der Maas.

Register here: www.d-iep.org/workshopanal...

WORKSHOPANALOGIES | DIEP

www.d-iep.org

May 6, 2025 at 10:48 PM

Reposted by Alberto Hernández Marcos

ruggsea

@ruggsea.bsky.social

Reposting from the other site to spread it here: apparently, thinking that RLHF irons out creativity in LLMs is now corroborated by this paper arxiv.org/pdf/2505.00047

May 6, 2025 at 11:45 PM

Reposted by Alberto Hernández Marcos

Keezy Young🌼

@keezyyoung.bsky.social

the specific laughter of a little kid who is being taken on a ride of some kind (tricycle, plastic car, thrown up in the air by a dad etc) is one of the most precious and valuable things you'll ever hear

May 7, 2025 at 1:35 AM

Reposted by Alberto Hernández Marcos

Melanie Mitchell

@melaniemitchell.bsky.social

Karpathy: We have reached "jagged Intelligence"

Mollick: We have reached "jagged AGI"

Next up: "jagged consciousness"?

www.oneusefulthing.org/p/on-jagged-...

On Jagged AGI: o3, Gemini 2.5, and everything after

New models and new thresholds

www.oneusefulthing.org

May 2, 2025 at 4:57 PM

Reposted by Alberto Hernández Marcos

César Astudillo

@cesarastudillo.bsky.social

A quienes teníais pensado acudir: por desgracia habrá que aplazar el evento porque la UCM ha suspendido todas las actividades. Se establecerá una nueva fecha a corto plazo y por supuesto os la contaré.

Píxel Sonoro @pixelsonoro.bsky.social · Apr 28

🎵🎶¡Mañana celebramos este evento en homenaje a la obra de @cesarastudillo.bsky.social y @riskwood.bsky.social
donde profundizaremos en la creación musical durante la "edad de oro"

✅Exposiciones
✅Interpretación de arreglos orquestales de algunas de sus piezas

¿Cómo puedes seguirlo? ⬇️⬇️

April 29, 2025 at 7:41 AM

Reposted by Alberto Hernández Marcos

Gus

@gusthema.bsky.social

Gemma 3 are just amazing models!

but what if you want to manipulate it's internal activations to understand how it does its text generation?

Sascha Rothe is here to teach you how!

Great insights for anyone curious about the inner workings of LLMs!

www.youtube.com/watch?v=JTUs...

Inside Gemma 3: Modifying the output through activation hacking

YouTube video by Google for Developers

www.youtube.com

April 28, 2025 at 1:57 PM

Reposted by Alberto Hernández Marcos

Ethan Mollick

@emollick.bsky.social

👀Today’s AIs are already hyper persuasive.

A controversial study where LLMs tried to persuade users on Reddit found that: “Notably, all our treatments surpass human performance substantially, achieving persuasive rates between three and six times higher than the human baseline.”

April 28, 2025 at 5:20 PM

Reposted by Alberto Hernández Marcos

Píxel Sonoro

@pixelsonoro.bsky.social

🎵🎶¡Mañana celebramos este evento en homenaje a la obra de @cesarastudillo.bsky.social y @riskwood.bsky.social
donde profundizaremos en la creación musical durante la "edad de oro"

✅Exposiciones
✅Interpretación de arreglos orquestales de algunas de sus piezas

¿Cómo puedes seguirlo? ⬇️⬇️

April 28, 2025 at 8:39 AM

Reposted by Alberto Hernández Marcos

Phillip Carter

@phillipcarter.dev

I have yet to see an "intro to AI" video as comprehensive but also approachable as Andrej Karpathy's 1hr overview. I maintain that anyone who watches this (and pays attention the whole time) will come away with an intuitive understanding of how to use LLMs www.youtube.com/watch?v=zjkB...

[1hr Talk] Intro to Large Language Models

YouTube video by Andrej Karpathy

www.youtube.com

April 10, 2025 at 1:14 AM

Reposted by Alberto Hernández Marcos

Kate Beaton

@katebeaton.bsky.social

Richard Serra lived seasonally in our village so we went to the Reina Sofia to see their room of his work, but the children had no love for grand abstract minimalist rectangles, and we were forced to depart after much whining, it was a quixotic quest anyway

A person standing in front of a Richard Serra artwork

April 12, 2025 at 6:59 AM

Reposted by Alberto Hernández Marcos

Ethan Mollick

@emollick.bsky.social

If you wanted to see how little attention folks are paying to the possibility of AGI (however defined) no matter how much the labs publicly discuss it, here is an official course from Google Deepmind whose first session is "we are on a path to superhuman capabilities"

It has less than 1,000 views.

April 3, 2025 at 3:05 PM

Alberto Hernández Marcos

@alberto-h.bsky.social

And this applies to so many other approaches, like building-my-own-RAG...

Ethan Mollick @emollick.bsky.social · Apr 2

The fast evolution of AIs makes it risky to spend tons of effort getting around current model limits through clever approaches, rather than waiting.

I trained a LoRA 6 months ago to get an image of a Wired Magazine cover, now I can just ask GPT-4o to make it.

Wonder if this was a trap for Apple.

April 2, 2025 at 4:34 PM

Reposted by Alberto Hernández Marcos

Serge Belongie

@serge.belongie.com

Would you present your next NeurIPS paper in Europe instead of traveling to San Diego (US) if this was an option? Søren Hauberg (DTU) and I would love to hear the answer through this poll: (1/6)