Lightnews — Scholar-powered news

Scott Condron

@scottcondron.bsky.social

How do I get Bluesky to show me less politics and more AI/ML things? I have followed mostly people who work in AI/ML

March 9, 2025 at 11:12 AM

Scott Condron

@scottcondron.bsky.social

Prompts within a complex system are brittle

I have seen some teams be successful by replacing prompts with smaller, more deterministic components and improved reliability with fine-tuning. Anyone else have success with this approach?

Seems to help a lot with agents

November 29, 2024 at 10:16 AM

Reposted by Scott Condron

Cathy Wu

@cathywu.bsky.social

I collected some folk knowledge for RL and stuck them in my lecture slides a couple weeks back: web.mit.edu/6.7920/www/l... See Appendix B... sorry, I know, appendix of a lecture slide deck is not the best for discovery. Suggestions very welcome.

web.mit.edu

November 27, 2024 at 1:36 PM

Scott Condron

@scottcondron.bsky.social

If you’re taking time to enjoy your family and not building with LLMs, you’re ngmi.
America is cooked

November 28, 2024 at 7:01 AM

Scott Condron

@scottcondron.bsky.social

LLM app dev broke our comparison tools because tiny diffs can cause large behaviour change.

At wandb, we've spent years thinking about experiment comparison. We've added new tools for LLM app dev: code, prompts, models, configs, outputs, eval metrics, eval predictions, eval scores..
wandb.me/weave

November 26, 2024 at 1:38 PM

Scott Condron

@scottcondron.bsky.social

The art of how to refer to model behaviour with tasteful non-person metaphors. Say “stochastic” you’re in one camp, say “emergent” you’re in another.
It’s a minefield out there people

Simon Willison @simonwillison.net · Nov 25

I've been very resistant to those in the past, but I've come round to it now

Learning to use LLMs really is a whole lot easier if you apply "person" metaphors to them

I trust people to figure out that they're not sci-fi AI entities once they really start digging in and using them

November 25, 2024 at 8:54 PM

Reposted by Scott Condron

Eugene Vinitsky 🍒

@eugenevinitsky.bsky.social

Being logged into wandb on your phone is a recipe for misery

November 20, 2024 at 4:09 AM

Scott Condron

@scottcondron.bsky.social

Lessons from creating an llms.txt file
An llms.txt file is a way to tell a LLM about your website. In the .txt file, you include links to other files with info to learn more.
- the llms.txt file isn't the file you send to an LLM, you use it to generate a llms .md file

November 21, 2024 at 7:16 PM

Scott Condron

@scottcondron.bsky.social

Your human and LLM judges should follow the same criteria.

Then, you can transition from manual to automated evaluation once you have inter-annotator agreement between LLM & human. You now have a faster iteration speed and the annotator can focus on finding edge cases!

November 20, 2024 at 8:30 PM

Scott Condron

@scottcondron.bsky.social

Put glue on pizza

November 20, 2024 at 8:53 AM

Scott Condron

@scottcondron.bsky.social

The most bizarre AI interview I've ever done was at wandb when as usual I asked a candidate to build an AI classifier in any language/framework of their choice..

And they nonchalantly said "I'll write it in Redstone", to which I almost let loose a chuckle until...

November 19, 2024 at 10:16 PM

Scott Condron

@scottcondron.bsky.social

Claude defaults to concise responses when there's high demand, clever way to smooth peaks

November 19, 2024 at 8:21 PM

Reposted by Scott Condron

Alex Volkov (Thursd/AI)

@altryne.bsky.social

We've been working on just that at @weightsbiases.bsky.social with Weave!

Weave is a lightweight llm tracing and evaluations toolkit, that focuses on letting you iterate fast and make sure that your production LLM based application is not degrading when you change prompts or models!

November 18, 2024 at 5:41 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news