ES version is available. Content is displayed in original English for accuracy.
Advertisement
Advertisement
⚡ Community Insights
Discussion Sentiment
33% Positive
Analyzed from 328 words in the discussion.
Trending Topics
#data#abliteration#heretic#alibaba#should#hacker#models#prevent#https#still

Discussion (14 Comments)Read Original on HackerNews
See https://arxiv.org/abs/2505.19056
https://github.com/p-e-w/heretic
For some of the latest models the previous abliteration techniques, e.g. the heretic tool, have stopped working (at least this was the status a few weeks ago).
Of course, eventually someone might succeed to find methods that also work with those.
Makes you wonder where that data was taken from, or if their great firewall is broken, or even if Alibaba engineers have special access...
What is perhaps more surprising is that the data was not scrubbed before training, but maybe they thought that would be too on-the-nose for the rest of the world and would hamper their popularity if they were too obviously biased.
It even went as far as confirming that we should always base our opinion on multiple sources, not just the government.
We should create badges like "script kiddie", "llm hacker", "grandpa's printer adjuster"