cedric
banner
ccolas.bsky.social
cedric
@ccolas.bsky.social
Building autotelic agents from socio-cultural interactions
https://ccolas.github.io/
induction vs transduction point holds

with induction you can search because you have a metric to optimise (% train examples correct)

with transduction there is no clear metric to guide search / brute force, so the model needs to get it right? or to come up with a way to guide its own search
December 22, 2024 at 12:19 AM
just checked, on semi-private set ryan got 43 (not that far i admit)
December 22, 2024 at 12:15 AM
ok he did use for loops so he didn't hill climb, but you can 'filter good candidates' by taking the solutions that solves 100% training examples, and submit only these as solutions

with transduction you can't filter

the challenge rules say you can submit only 2 (3?) solutions per pb
December 22, 2024 at 12:09 AM
2) he used program synthesis which allows hill climbing on the % of training examples correct

if the o3 prompt that circulates is correct, the o3 score uses transduction (predicting output grid directly), and you can't hill climb there

you can ensemble, but that doesn't help much for hard pbs
December 22, 2024 at 12:03 AM
this is a misleading comparison for two reasons

1) that guy got 50% on the public test set, which is easier than the private test set where o3 reached the 85%(87?)
December 22, 2024 at 12:03 AM
the official testing procedure is 2 or 3 solutions per problem iirc, don't think chollet would have let them brute force it

it seems they don't use program induction, so they can't hill climb on training examples either
December 21, 2024 at 3:57 PM
December 9, 2024 at 7:48 PM
hope to see you all at the IMOL worskhop on sunday!
December 9, 2024 at 7:47 PM
December 9, 2024 at 7:46 PM
hi Melanie,
we have a cool workshop on intrinsically motivated open-ended learning with a blend of cogsci and ai on dec 15

@IMOLNeurIPS2024 on X

see program here: imol-workshop.github.io/pages/program/
November 29, 2024 at 7:27 PM
oh cool, what's the paper? i've been thinking it could be the case and was wondering who wrote about it
November 28, 2024 at 1:04 AM
find more info at flowers.inria.fr/jobs/ (other positions are open)
Jobs - Flowers Laboratory
FLOWing Epigenetic Robots and Systems
flowers.inria.fr
November 27, 2024 at 5:43 PM
balancing exploration and exploitation with autotelic rl

autotelic rl is usually concerned with open-ended exploration in the absence of external reward

how should we conduct an open-ended exploration *at the service* of an external task?

deep rl skills required
November 27, 2024 at 5:43 PM
llm-mediated cultural evolution

we wanna study how llm-based agents can be used to facilitate collective intelligence in controlled human experiment where groups of participant collectively find solutions to problems

this requires some background in cogsci + llms
November 27, 2024 at 5:43 PM