DE version is available. Content is displayed in original English for accuracy.
Advertisement
Advertisement
⚡ Community Insights
Discussion Sentiment
50% Positive
Analyzed from 111 words in the discussion.
Trending Topics
#information#more#anyone#models#bits#wonder#figured#compressed#calculated#amount

Discussion (5 Comments)Read Original on HackerNews
I wonder if anyone has figured out how the information is compressed and calculated the amount of information an LLM can hold depending on its size
You might want to look at Physics of Language Models[1]. IIRC, the authors estimate it to be ~2 bits of factual knowledge per parameter.
[1]: https://physics.allen-zhu.com/