Author | Lightnews

Reposted by Lukas N.P. Egger

@shriram.bsky.social

@stephaniemlee.bsky.social has written a fine piece about the NeurIPS high school AI contest!
www.chronicle.com/article/teen...

Teens Are Doing AI Research Now. Is That a Good Thing?

No need for a Ph.D.: At artificial intelligence’s biggest conference, high schoolers competed to present.

www.chronicle.com

January 15, 2025 at 1:55 AM

Reposted by Lukas N.P. Egger

Marcelo Calbucci

@calbucci.com

I wish OpenAI would focus on playing the orchestration layer among all my apps/services, than to become one more place for me to manage tasks. www.theverge.com/2025/1/14/24...

January 14, 2025 at 10:18 PM

Reposted by Lukas N.P. Egger

hardmaru

@hardmaru.bsky.social

I think in the future, the line between “pre-training” and “post-training” will be gone, and our models and agents will continuously adapt and self-improve.

January 15, 2025 at 5:57 AM

Reposted by Lukas N.P. Egger

Daniel Eth (yes, Eth is my actual last name)

@daniel-eth.bsky.social

Every now & then I come across this view, and my reaction is - why? We’ve developed AI systems that can converse & reason and that can drive vehicles w/o an understanding at level of fundamental principles, why should AGI require it? Esp since the whole point of ML is the system learns in training

January 6, 2025 at 5:13 AM

Reposted by Lukas N.P. Egger

Daniel P. Moriarity

@dpmoriarity.bsky.social

"A study of federally funded research projects in the United States estimated that principal investigators spend on average about 45% of their time on administrative activities related to applying for and managing projects rather than conducting active research"

www.pnas.org/doi/10.1073/...

January 4, 2025 at 1:26 PM

Reposted by Lukas N.P. Egger

Sung Kim

@sungkim.bsky.social

Apparently, you can run DeepSeek-V3 locally, provided that you have 8 M4 Pro 64GB Mac minis.

~5 tok/sec.

December 27, 2024 at 3:03 AM

Lukas N.P. Egger

@brusik.bsky.social

Just because AI is highly capable doesn’t mean one can immediately make use of it. Jobs will need to be reorganized before any serious automation will take hold aka. process re-engineering.

I expect to see many more visualizations like this in the near future.

December 25, 2024 at 5:31 PM

Reposted by Lukas N.P. Egger

Grace

@gracekind.net

December 24, 2024 at 8:52 PM

Reposted by Lukas N.P. Egger

Ryan Moulton

@moultano.bsky.social

Ok, this genuinely freaks me out. I had thought this had taken longer.
x.com/_jasonwei/st...

Jason Wei tweeting "o3 is very performant. More importantly, progress from o1 to o3 was only three months, which shows how fast progress will be in the new paradigm of RL on chain of thought to scale inference compute. Way faster than pretraining paradigm of new model every 1-2 years"

December 20, 2024 at 10:29 PM

Reposted by Lukas N.P. Egger

Melanie Mitchell

@melaniemitchell.bsky.social

From @fchollet.bsky.social over on the other site.

"they demand serious scientific attention" -- indeed! But how to do science on these results without more openness?

December 20, 2024 at 6:15 PM

Reposted by Lukas N.P. Egger

Ethan Mollick

@emollick.bsky.social

Independent evaluations of OpenAI’s o3 suggest that it passed math & reasoning benchmarks that were previously considered far out of reach for AI including achieving a score on ARC-AGI that was associated with actually achieving AGI (though the creators of the benchmark don’t think it o3 is AGI)

December 20, 2024 at 6:26 PM

Reposted by Lukas N.P. Egger

Andrew Lampinen

@lampinen.bsky.social

Very cool to see the progress on a variety of logic/math/other reasoning tasks with LMs! I especially appreciate that the researchers are being explicit that it's *just* an LM + RL (x.com/__nmca__/sta...). A few reflections: 1/5

x.com

December 20, 2024 at 7:21 PM

Reposted by Lukas N.P. Egger

Soumith Chintala

@soumithchintala.bsky.social

intelligence is starting to get good, but context is still siloed for stupid reasons.
get models that do human-level computer-use already, please...!

December 20, 2024 at 8:59 PM

Reposted by Lukas N.P. Egger

hardmaru

@hardmaru.bsky.social

I think GANs are going to come back in a big way in 2025, but with a twist. History doesn’t repeat itself but it often rhymes.

December 18, 2024 at 1:39 AM

Reposted by Lukas N.P. Egger

Emily Hunt

@emily.space

What could possibly go wrong? The latest OpenAI model sometimes actively tries to deceive users:

OpenAI's o1 model sure tries to deceive humans a lot | TechCrunch

OpenAI finally released the full version of o1, which gives smarter answers than GPT-4o by using additional compute to "think" about questions. However,

techcrunch.com

December 6, 2024 at 9:07 AM

Reposted by Lukas N.P. Egger

Ethan Mollick

@emollick.bsky.social

Models like o1 suggest that people won’t generally notice AGI-ish systems that are better than humans at most intellectual tasks, but which are not autonomous or self-directed

Most folks don’t regularly have a lot of tasks that bump up against the limits of human intelligence, so won’t see it

December 7, 2024 at 12:49 AM

Reposted by Lukas N.P. Egger

Yisong Yue

@yisongyue.bsky.social

I've been around the block a few times. When deep learning first became hot, many older colleagues bemoaned it as just tinkering + chain rule, and not intellectually satisfying. Then came SSL, equivariance, VAEs, GANs, neural ODEs, transformers, diffusion, etc. The richness was staggering.

🧵👇

Vicki @vickiboykis.com · Dec 6

Great post that captures the tension between classic ML approaches and modern deep learning while acknowledging the nuances of both.

“Working with LLMs doesn’t feel the same. It’s like fitting pieces into a pre-defined puzzle instead of building the puzzle itself.”

www.reddit.com/r/MachineLea...

From the MachineLearning community on Reddit

Explore this post and more from the MachineLearning community

www.reddit.com

December 6, 2024 at 5:06 PM

Lukas N.P. Egger

@brusik.bsky.social

We haven’t seen an interesting new take on GTM and it is my firm believe that we need one, badly.

My current best guess is that we will go towards something like labor replacement cost, a negative seats subscription if you will.

December 6, 2024 at 7:34 PM

Lukas N.P. Egger

@brusik.bsky.social

And tokens or consumption based pricing also has peculiar misaligned incentives, e.g., adversity to adoption of better LLMs or complete unpredictability of floor pricing.

December 6, 2024 at 7:34 PM

Lukas N.P. Egger

@brusik.bsky.social

SaaS assumes (mostly) that a $/seat license makes sense which isn’t working either but for less obvious reasons.

December 6, 2024 at 7:34 PM

Lukas N.P. Egger

@brusik.bsky.social

Outcome based pricing is always interesting during dramatic change because the deltas are huge. Some smaller firms can navigate the difficulties but it is the wrong answer for most because the indicators are lagging badly and the system can easily be gamed. Too tough to pull off.

December 6, 2024 at 7:34 PM

Reposted by Lukas N.P. Egger

Ethan Mollick

@emollick.bsky.social

Here is a fun o1 test. I gave it this XKCD comic & the prompt: "make this a reality. i need a gui and clear instructions since i can't code. that means you need to give me full working software"

It took less than 15 minutes and it didn't get caught in any of the usual LLM loops, just solved issues.

December 5, 2024 at 6:55 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news