Lightnews — Scholar-powered news

_ - \. 🇺🇸

@crumb.bsky.social

yessss im going to be digesting this idea for a little while this is a great reminder thank you

December 20, 2025 at 7:18 PM

_ - \. 🇺🇸

@crumb.bsky.social

cause like then it'd be even more about the geometry of the space and you can re-center / rotate the state some how to show that it is the right shape... um i dont have a real world analog for this that doesn't make me sound really really schizophrenic

December 20, 2025 at 7:03 PM

_ - \. 🇺🇸

@crumb.bsky.social

in attached paper instead of determining the origin of "a text" like i am it determines the origin of "the answer to a prompt" and the generator generates an answer to the prompt

December 20, 2025 at 7:01 PM

_ - \. 🇺🇸

@crumb.bsky.social

last night though i was toying with Fourier transforms and thinking maybe we should try to manufacture specific attractors instead of reach specific states... maybe that's "ignore till we get there" though

December 20, 2025 at 6:59 PM

_ - \. 🇺🇸

@crumb.bsky.social

i hope u see what i'm saying n don't think i'm on "we should train an agent to navigate a space," we should train an agent to navigate any space while only seeing a low dimensional projection of the space, only applying forces to the environment in a low dimensional action space

December 20, 2025 at 6:59 PM

_ - \. 🇺🇸

@crumb.bsky.social

yeah okay i'll put a pot of coffee on and try to get it running cause now i'm worked up about it again

December 20, 2025 at 6:59 PM

_ - \. 🇺🇸

@crumb.bsky.social

obv with a traditional reservoir it's different than "reached" vs didnt, we'd have some threshold

i think though nearly Everything in the real world is reservoir-y right so navigating reservoir spaces would be super useful

December 20, 2025 at 6:59 PM

_ - \. 🇺🇸

@crumb.bsky.social

you would or wouldn't give the navigator the exact state if you wanted it to be the most useful, like it sees how close it is to the right state half of the time and has to deduce that itself from seeing the current state and the target the other half?

December 20, 2025 at 6:59 PM

_ - \. 🇺🇸

@crumb.bsky.social

i think im overthinking it because this seems like something i could just implement right now and test in like google colab
i think an intelligent agent like this could also be used to modulate specific forces to confine plasma in new ways

December 20, 2025 at 6:59 PM

_ - \. 🇺🇸

@crumb.bsky.social

you'd make sure the states _are reachable_ but _hard to reach_ so maybe +1 for proposer for "navigator eventually reached" and then proportional reward to the length of the trajectory that the navigator needed? navigator gets rewards for reaching and inverse other term?

December 20, 2025 at 6:59 PM

_ - \. 🇺🇸

@crumb.bsky.social

im still trying to develop this idea outside of code as much as i can before going into the code (which could be the wrong approach but whatever..) like do we just try for novelty? i think it'd have to be a self-play with proposing a state to reach and then reaching it

December 20, 2025 at 6:59 PM

_ - \. 🇺🇸

@crumb.bsky.social

discriminator's output gets assigned positive reward when it correctly determines the origin of a text (generated / real)

generator's output gets assigned positive reward when it fools the discriminator

increase likelihood of high rwd sequences, decrease likelihood of low rwd sequences

December 20, 2025 at 6:58 PM

_ - \. 🇺🇸

@crumb.bsky.social

text gan with reasoning

December 20, 2025 at 6:52 PM

_ - \. 🇺🇸

@crumb.bsky.social

for obvious reasons i am not making this dataset public but it is incredibly easy to recreate, yikes! or cool! whatever you think is the right thing for me to say here

December 16, 2025 at 6:42 AM

_ - \. 🇺🇸

@crumb.bsky.social

if you keep a tight bound on the distance from the original embedding this looks like changes in specific word choice or quirks of grammar

December 16, 2025 at 6:42 AM

_ - \. 🇺🇸

@crumb.bsky.social

a model trained on this set, along with my text autoencoder "essence" can be used to create a system that autonomously edits a post to maximize the likelihood of a reply from a user and tune specific qualities of their likely reply

December 16, 2025 at 6:42 AM

_ - \. 🇺🇸

@crumb.bsky.social

we did some scraping and data engineering and have 2 billion tokens to use to model posts that users reply to and their replies to them given their recent activity

December 16, 2025 at 6:42 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news