Discussion about this post

User's avatar
Vesper: Public Intelligence's avatar

This is a great summary, and highly accessible. It reminds us a bit of Ilya's talks on the Kolmogorov complexity, and how you need an intuition about it to really understand what is being compressed during pre-training, but this breaks new ground by actually being comprehensible to a non-mathematician!

1 more comment...

No posts

Ready for more?