Lightnews — Scholar-powered news

Lukas Galke

@lukasgalke.bsky.social

110 followers 370 following 10 posts

Assistant Professor @SDU tracing connectionist mechanisms.

https://lgalke.github.io

Posts Replies Media Videos

Lukas Galke

@lukasgalke.bsky.social

When analyzing the learning trajectory of RNNs throughout training, we make several other interesting observations: medium-structured languages have an learnability advantage early in training (likely due to same word being used for multiple meanings) but fall behind high-structured languages later.

December 30, 2024 at 6:34 PM

Lukas Galke

@lukasgalke.bsky.social

We find a similar effect when looking at memorization errors. In the memorization test, the task for in-context LLMs boils down to copying a word that is present earlier in the prompt. But even here, we can see an advantage of language structure.

December 30, 2024 at 6:34 PM

Lukas Galke

@lukasgalke.bsky.social

All these learning systems, small RNNs, pre-trained LLMs, and humans, show *very* similar memorization and generalization behavior -- with more structured languages leading to generalizations that are more systematic generalization and more similar to the generalization of human participants.

December 30, 2024 at 6:34 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news