ES version is available. Content is displayed in original English for accuracy.
Advertisement
Advertisement
⚡ Community Insights
Discussion Sentiment
50% Positive
Analyzed from 560 words in the discussion.
Trending Topics
#reddit#zenodo#training#model#something#don#why#https#pretty#thats

Discussion (16 Comments)Read Original on HackerNews
I've definitely experienced this. Before I learned to watch for it, I spent around an hour correcting Claude about something or other repeatedly. It kept agreeing and explaining to me that it understood what mistakes it made and telling me that it would do better, then it would repeat said mistakes. Eventually, I realized that it was in a loop and couldn't escape. I had it write a handoff doc for the next agent. That one quickly did what I wanted. Such a waste of time.
I don't know how prone LLMs are to entering such a state. I know to watch for it now, so I've only reached the edges of it before ejecting and starting over. But it appears to be not-uncommon. I could also be pattern-matching things that aren't actually that but bailing without proof to save myself time. Unclear.
I did not care for the "X article" (is that what it's called?), but I don't get the rage that is in that reddit thread.
Since we are in the golden age of grifting, this guy will probably go pretty far.
I dunno, I think thats pretty convincing (http://voicefirst.expert/about/)
https://zenodo.org/records/17720178
Note that Zenodo is a DOI-provider, not a (scientific) journal. Anyone can upload anything to Zenodo. It's less strict than arXiv.
Edit: The "paper" is written by one Hiroko Konishi, an independent researcher (she is a voice actress).
Of course they reflect the bias in the training, thats been known since the 90s if not longer (see apocryphal story about training to detect tanks, but only detecting either trees or clouds)
but like this is expected, the whole point of RLHF (or any other feedback) is to condition the model to respond in a certain way. Thats what makes them useable for a bunch of situations.
You feed it reddit and wikipeidia it's gonna turn into a conformist npc.
You feed it the contents of professional content and it's gonna spew vapid corporate nothingness.
You feed every text message ever sent over Boost Mobile, actually wait that sounds hilarious someone should do that.