Benjamin Gagl
banner
benjamingagl.bsky.social
Benjamin Gagl
@benjamingagl.bsky.social
Assistant Professor for Self Learning Systems @UniCologne
#Reading #NeuroCognition #ComputationalModels

https://selflearningsystems.uni-koeln.de/
Fun project with much much more nuanced experimenting and discussion of the effects.
November 13, 2025 at 2:02 PM
Interestingly, over the ten corpora we derived in the study we found a correlation showing that the corpora with lower richness generally generate the more adequate word frequency measures.
November 13, 2025 at 2:02 PM
However, when using derived frequency to estimate the word frequency effect in the child word recognition behavior model, the fits indicate that the frequency measures based on the LLM corpus captures the effect more adequately than frequency based on the children's book corpus.
November 13, 2025 at 2:02 PM
Investigating the characteristics that LLMs generate when asked to produce text for children.

We focus on lexical richness and word frequency, showing that the text is generally less rich compared to children's books.
November 13, 2025 at 2:02 PM