Author | Lightnews

Ian Bicking @ianbicking.org · 14h

There's no right answer, but there are _better_ answers. Really it's a discernment process for us to figure out: we don't want the LLM to just "be positive" or "be negative" but instead we have to articulate very carefully what we really do want. (Also we don't know what we want.)

1 1 1

Ian Bicking @ianbicking.org · 14h

Is there some universal rubric we should be applying before complimenting the user? Or just hand out compliments at some rate, like "this is in the top 25% of the user's ideas, gold star!" Or concentrate on good ideas and skim over the bad ones?

1

Ian Bicking @ianbicking.org · 14h

When I see some clever prompt to make the LLM stop being sycophantic I am reminded of this... the problem isn't really positivity (positivity is actually great!), but a lack of discernment. When it compliments something that doesn't deserve it. But what does "deserve it" even mean?

1

Ian Bicking @ianbicking.org · 14h

This comes up with all kinds of LLM discernment. If you think the LLM has a bias in one direction you can tweak the prompting, but you don't get to a "correct" discernment by just making sure the distribution looks correct. If you should accept 50% and reject 50%, it also matters which 50%

1

Ian Bicking @ianbicking.org · 14h

Thinking a little more about LLM sycophancy...

In general discernment is very hard to get right. You ask for critique and you'll get critique. You ask for a compliment and you'll get a compliment. There is no "just tell me the truth."

2

Ian Bicking @ianbicking.org · 2d

I don't really resent the rate limits, the underlying cost is real. But mostly I'm surprised that at $20/mo (and on the "auto" model setting) Cursor will happily grind for hours every day. And honestly I prefer its results for most coding tasks. (In this case I'm using Claude Code for non-code)

1

Ian Bicking @ianbicking.org · 2d

I didn't think I was even using Claude Code that much, and I hit a rate limit. The rate limit also blocks me from using the normal Claude chat interface, which is an interesting choice. OpenAI's Codex similarly conked out fairly early with a rate limit, meanwhile Cursor keeps going and going...

1

Ian Bicking @ianbicking.org · 3d

I'm not sure I understand the likely impact of at large seats...? I can imagine a bias towards higher-turnout voting segments. Is it more than that?

1

Reposted by Ian Bicking

Python Software Foundation @python.org · 3d

TLDR; The PSF has made the decision to put our community and our shared diversity, equity, and inclusion values ahead of seeking $1.5M in new revenue. Please read and share. pyfound.blogspot.com/2025/10/NSF-...
🧵

The official home of the Python Programming Language

www.python.org

130 2.7K 6.2K

Ian Bicking @ianbicking.org · 6d

I'm kind of enjoying watching it and trying to reconstruct the surreal tesseract world it embodies.

But I guess what strikes me is how banal the game is. The environment isn't any richer for the AI. Filled with stuff, but just a different kind of background noise.

1 1