GLM-5.2 is probably the most powerful text-only open weights LLM19BBrajeshwar about 4 hours ago 1 commentsRead Article on simonwillison.net DE version is available. Content is displayed in original English for accuracy.
Discussion (1 Comments)Read Original on HackerNews
If we didn’t have the previous example I would interpret this as pretty solid evidence that labs were training on the Pelican “benchmark”.
I just can’t imagine a model dropping so significantly from one version to the next on such a silly task.