ZH version is available. Content is displayed in original English for accuracy.
Advertisement
Advertisement
⚡ Community Insights
Discussion Sentiment
67% Positive
Analyzed from 292 words in the discussion.
Trending Topics
#flops#order#https#python#perform#single#programing#gpu#statement#core

Discussion (12 Comments)Read Original on HackerNews
wild
"The I7-4770K and preform 20k more Flops than C++" is an equally sensible statement (i.e. not)
The rest will be from "python float" (e.g. not from numpy) to C, which gives you already 2 to 3 order of magnitude difference, and then another 2 to 3 from plan C to optimized SIMD.
See e.g. https://github.com/Avafly/optimize-gemm for how you can get 2 to 3 order of magnitude just from C.
https://arxiv.org/abs/1912.02292