by John Gardner
by John Gardner — Reposted by John Gardner
“Distillation of atomistic foundation models across architectures and chemical domains”
Deep dive thread below! 🤿🧵
by John Gardner
by John Gardner
by John Gardner
by John Gardner
github.com/jla-gardner/...
by John Gardner
by John Gardner
by John Gardner
by John Gardner
by John Gardner
by John Gardner
Note that these student models are of a different architecture to MACE, and in fact ACE is not even NN-based.
by John Gardner
by John Gardner
by John Gardner
by John Gardner
by John Gardner
by John Gardner
by John Gardner
by John Gardner
by John Gardner
by John Gardner
by John Gardner
@ask1729.bsky.social
and others extract additional Hessian information from the teacher. Again, this works well providing you have a training framework that lets you train student models on this data.
by John Gardner
and others attempt to align not only the predictions, but also the internal representations of the teacher and the student. This approach works well for models with similar architectures, but is incompatible with e.g. fast linear models like ACE.
by John Gardner
Various existing methods in the literature do this in different ways.
by John Gardner
This lets you explore new science, and democratises access to otherwise expensive simulations/methods and foundation models. 💪
by John Gardner
If this can be done well, it is an extremely useful thing!
by John Gardner
by John Gardner
jla-gardner.github.io/graph-pes/
by John Gardner
Please also reach out via GitHub issues or DM on here if you have any questions or feedback.
github.com/jla-gardner/...