Back to News
Advertisement
Advertisement

⚡ Community Insights

Discussion Sentiment

100% Positive

Analyzed from 255 words in the discussion.

Trending Topics

#interesting#llm#read#readme#author#cuda#approach#reading#assembly#through

Discussion (16 Comments)Read Original on HackerNews

yu3zhou4about 23 hours ago
README is in my opinion (author here) the most interesting - I wrote it to help others build useful mental model to be able to recreate the project yourself, without need to even read my code
lukemerrickabout 4 hours ago
I am not super familiar with C and CUDA, so I read solely for the README and enjoyed it supremely. The blend of cheerful walking through instructive examples and your philosophical takes on how to approach the exercise to get the most out of it put me in a great mood. You captured that special upbeat attitude that comes about when you're doing something as well as you can just because it's so legitimately interesting to you.
janalsncmabout 18 hours ago
Really practical teaching approach. I clicked in to see how safetensors are loaded and just kept reading. Thanks for sharing.
tom-walabout 12 hours ago
I feel like I learned twice as much in 10 minutes reading this than I did reading LLM for Dummies. Thank you
xuanlin314about 17 hours ago
The lesson-style README is a great approach. Breaking down LLM inference into digestible steps makes the codebase approachable even for people who haven't touched CUDA before.
dwa3592about 21 hours ago
Very nice job on read me.

>>Physically, LLM is a file which contains a lot of float numbers.

aka atoms of the LLM.

cyanydeezabout 21 hours ago
the universe is just atomic if statments
nullpoint420about 12 hours ago
it from bit
GoldenJadeabout 17 hours ago
Thanks for sharing this. As someone currently researching LLMs, I'm sure I'll be referencing this quite a bit going forward.
juancnabout 22 hours ago
Looks interesting, it reminds me of the first llama.cpp, but better documented.
nazgulsenpaiabout 23 hours ago
I love the documentation formatted in lessons. I can't wait to read through it.
cookiengineerabout 21 hours ago
Wanted to add that the author has an amazing blog with lots of interesting papers: https://jedrzej.maczan.pl/
smy_smyabout 3 hours ago
interesting!
sylwareabout 10 hours ago
I am looking at a plain and simple C implemented LLM inference, and/or x86_64 assembly implemented, and/or AMD GPU RDNA assembly.

Anybody?

irishcoffeeabout 6 hours ago
I heard once that c++ can become assembly at some point if you type the right things in. :)
einpoklumabout 21 hours ago
It seems the author believes checking the return values of CUDA API calls is not "tiny" enough :-(
Advertisement