FR version is available. Content is displayed in original English for accuracy.
Advertisement
Advertisement
⚡ Community Insights
Discussion Sentiment
100% Positive
Analyzed from 486 words in the discussion.
Trending Topics
#papers#paper#paperswithcode#https#used#sota#domain#models#great#find

Discussion (17 Comments)Read Original on HackerNews
Niels here from the open-source team at Hugging Face. Like many others, I was a huge fan of paperswithcode, a website which allowed to easily find the state-of-the-art (SOTA) across any domain of AI, from computer vision to language models to time-series forecasting. Sadly, that website is no longer maintained after its acquisition by Meta.
Hence, I've been working on reviving it. I obviously use AI agents to parse papers at scale and automatically generate leaderboards (for now I'm the one verifying results). So far, I've only parsed high-impact papers for which I know they're SOTA, like Qwen 3.5 and 3.6, RF-DETR for object detection, DINOv3, SOTA embedding models from the MTEB leaderboard, the Open ASR Leaderboard for automatic speech recognition models, etc.
For now, it includes the following:
> trending papers by default based on Github star velocity
> categorization by domain, e.g., [OCR](https://paperswithcode.co/tasks/ocr)
> methods, popular techniques used across AI papers, which PwC used to have as well, like [RLVR](https://paperswithcode.co/methods/rlvr) and
> eval results for high-impact papers, see e.g., Qwen 3.5 at the bottom
> leaderboards for each domain, e.g., MMTEB or COCO val 2017
> conferences, like [CVPR 2026](https://paperswithcode.co/conferences/cvpr-2026)
> support for citation counts (you can also see the most cited papers by domain!)
> automated linked Github, project page URLs, and artifacts (+ multiple repos are supported on a paper page)
> support for external papers beyond Arxiv, see e.g., [DeepSeek v4](https://paperswithcode.co/paper/82956)
> Harness reports for coding agent benchmarks, e.g., Terminal Bench
> "Sign in with HF" and Storage Buckets are used to store humbnails, paper PDFs, and overall data backups.
I'm curious about your feedback + feature requests!
Try it at https://paperswithcode.co
It would be lovely to parse which datasets/benchmarks were used in the comparisons and select papers by dataset!
In many fields the datasets vary greatly depending on the subfield and its very difficult to find what other benchmarks could be used.
One feature I would love is to get notified via email when new papers are added (or periodically, once a week/daily).