> The model, called VibeThinker-3B, scored 94.3 on AIME 2026 β the American Invitational Mathematics Examination, one of the most demanding standardized math competitions in the world. That figure places it alongside DeepSeek V3.2, a model with 671 billion parameters
Overfitting, no need to argue about anything I think?
The rest of the article seems to echoing people's misunderstanding of pretty elementary stuff.
Discussion (1 Comments)Read Original on HackerNews
Overfitting, no need to argue about anything I think?
The rest of the article seems to echoing people's misunderstanding of pretty elementary stuff.