Back to News
Advertisement
ddatafreak_ about 4 hours ago 7 commentsRead Article on stackscope.dev

ZH version is available. Content is displayed in original English for accuracy.

Hey all, I built StackScope, a crawler/catalogue that looks at new product launches and shows what they were built with.

It watches launches from Product Hunt, Show HN, and PeerPush, then crawls the public site behind each one. The goal is to show what people actually launched with: hosting, frameworks, analytics, DNS, security headers, legal pages, AI-builder signals, and other public clues.

I started building it because most stack-detection sites look at the web as a whole. I was more interested in the current indie launch scene: what people are choosing right now, at the point they first put something in public.

A few implementation details: it runs on .NET, uses Playwright for rendered pages, and has a first-party fingerprint catalogue rather than one copied from Wappalyzer/etc. robots.txt is honoured, and the bot identifies itself.

Frustratingly, I am still waiting for verified bot status from Cloudflare and currently that knocks out about 10% of all sites.

There is also a private readiness check: paste a URL, get the same style of report, fix things, and recrawl. No account or email needed.

I'd be interested in feedback on the usefulness of this, the methodology, and any obvious false positives.

Jonathan.

Advertisement

⚡ Community Insights

Discussion Sentiment

80% Positive

Analyzed from 189 words in the discussion.

Trending Topics

#css#stackscope#https#dev#cloudflare#txt#nice#looks#interesting#didn

Discussion (7 Comments)Read Original on HackerNews

pixel_popping37 minutes ago
datafreak_31 minutes ago
What's your response header content? Reads like it's being cf-mitigated.

Edit: loosened a cf rule, should be good now.

Gabriel_Valente20 minutes ago
Nice one, looks interesting. I didn't knew that Vercel was so far ahead on hosting. People don't usually use Cloudflare Pages for hosting? Or it doesn't enter in that metric? https://stackscope.dev/trends/2026/06

Overall this looks solid

datafreak_5 minutes ago
Cloudflare Pages get hidden behind the CF CDN, so are included in the CDN figure.
addedlovelyabout 2 hours ago
Nice one.

I've been doing the same bit wider scope, for the whole Crux list, pruned to apex domains, and looking for CMS signals - how's your throughput?

I'm not doing any headless browser stuff, or many requests, so hyper optimised for speed.

I do grab robots.txt - didn't really see much in llms.txt or humans.txt in the wild, does yours?

addedlovelyabout 2 hours ago
Ohh Cloudflare verified bot status, interesting I'll check that out.

I'm seeing about 6.6% block rate, but that does climb over time.

ididabout 2 hours ago
One extension, beyond stack: market category/domain/application - or any combo that tells me what the product does.

Fab project otherwise!