HI version is available. Content is displayed in original English for accuracy.
Advertisement
Advertisement
⚡ Community Insights
Discussion Sentiment
100% Positive
Analyzed from 225 words in the discussion.
Trending Topics
#entropy#context#loss#right#next#cross#length#capabilities#scaling#laws

Discussion (8 Comments)Read Original on HackerNews
Also, linear gains in context length scale quadratically with compute because of attention, so depending on context growth means taking a bath on GPUs for as long as you can, right?