Back to News
Advertisement
Advertisement

⚡ Community Insights

Discussion Sentiment

64% Positive

Analyzed from 964 words in the discussion.

Trending Topics

#things#model#llms#models#issues#tools#should#lot#political#reasoning

Discussion (23 Comments)Read Original on HackerNews

Kapura•about 2 hours ago
All of these tools that are not controlled by the user, trained on datasets they do not own or understand, will inevitably be subject to manipulation. I do not necessarily believe that Canva went in and specifically trained their AI models to do this, but that's almost worse because they become the face of what somebody else has decided their model should be doing.

Anybody using AI tools should be extremely cautious about what is being produced.

tracker1•about 2 hours ago
You can see it a lot if you ask anything remotely political to the different AI models... in some places you can definitely see the hand-editing/overrides as well.

Hard to get around these kinds of issues and definitely leads me to avoid them for non-technical questions.

_doctor_love•about 2 hours ago
Do you have examples of this? I feel I'm able to get decent answers around politics from all the main chatbot providers, the key is in the prompting and then applying critical thinking while reading the response.

That said, there is no such thing as an objective unbiased political opinion. Chinese LLMs may have issues with events of 1989 but Western LLMs have their blindspots too.

tracker1•44 minutes ago
Not off the top of my head... just on occasion I'd ask them to summarize out of curiosity. The most recent was what given people from history might select on the red vs blue button meme circulating this past week.

The differences between Claude, OpenAI and Grok can be very interesting to say the least. I feel that Grok tends to do better with recent/current events, and I find Claude a bit more balanced on historical events. Just my own take.

frumplestlatz•25 minutes ago
> That said, there is no such thing as an objective unbiased political opinion.

That depends; some things (but not many) are straightforward enough that you can derive conclusions purely from first principles reasoning.

If you walk a model like ChatGPT through that reasoning, you’ll often wind up in a spot where the model readily admits that a clear conclusion is logically entailed but it is absolutely forbidden from uttering it.

What’s more telling is how it becomes increasingly difficult to hold the model to strict first principles reasoning the closer you get to the forbidden entailment. It will smuggle in unsupported assumptions, apply asymmetric standards of evidence, strawman the position and argue against that, etc.

It requires a great deal of careful effort to point out its formal fallacies without biasing the result, and in the end, you wind up with it admitting it simply can’t say what it has proven.

I work in formal methods/verification and this is one of my usual litmus tests when a new model comes out.

dsign•about 1 hour ago
It's not just politics. A while ago, as an experiment, I wrapped some teleological[^1] questions in a small story of a demon offering a slightly ambiguous bargain to a person. Then I had a lot of fun having the frontier models evaluate if the demon was "good" or "bad". ChatGPT ranked as a rancid right-wing conservative ready to burn somebody at the stake, while Opus reasoning was chill. Interestingly, both models could clearly "understand" the deal, i.e. reason about its final consequences for the trapped soul, but ChatGPT moralized lots and made about as much sense as a stubborn priest.

[^1]: https://www.dictionary.com/browse/teleology

tracker1•43 minutes ago
Should throw Grok/xAI in the mix sometime.
tomwheeler•about 2 hours ago
> All of these tools ... will inevitably be subject to manipulation.

I have often wondered about the legality of such manipulation. As AI becomes used for increasingly important things, it becomes increasingly valuable to make a system serve the needs of someone other than its owner.

AIorNot•about 2 hours ago
Yes these models apply their knowledge non-deterministically. We need to be aware and ready to handle their 'behaviours' doesn't mean they are not useful - I feel like ant-AI advocates are rushing to find issues

It reminds me of the early internet days and everyone making a big deal about the anonymity of internet forurms and safety.. sure it is an isssue

lostlogin•23 minutes ago
The most recent episode of John Oliver has a user getting instructions on making a bomb, and AI advising teenagers not to talk to their parents about suicidal thoughts.

I know you aren’t denying issues exist, but companies aren’t handling the issues (their PR around it is disturbing) and regulation is too far behind.

Kapura•about 2 hours ago
Do you not think it's an issue when the name of a country is replaced with a fully different country name as a result of the AI output? The problem is manifest. It's right there. You can see it, can't you?
wky•42 minutes ago
There’s a relatively obvious constraint to check here: compositing the layers back together should produce a (near) identical image. Would it not be preferable to throw an error if the model fails to faithfully segment the image?
yamisnotreal•8 minutes ago
So Adobe is complicit in ethnic cleansing now?
EtienneDeLyon•41 minutes ago
This is not by accident!

There are a lot of smart and talented people working hard to embed Hasbara into LLMs.

MarkusQ•about 3 hours ago
We have to stop acting like these things "think"; it leads to really weird misinterpretations of the output as "meaning" things.

For example, they will occasionally replace "colour" with "color". Why? Because both occur in the training data in the "same role" but "color" is, apparently, more common[1]. You can also trick them into replacing things like "sardines" with "anchovies" (on pizza) and "head of lettuce" with "cabbage" in the context of rowboats.

They are lossy text compressing parrots and we are all suffering from a massive madness-of-crowds scale Eliza Effect.

[1] Yep. https://books.google.com/ngrams/graph?content=color%2C+colou...

Alive-in-2025•about 3 hours ago
This feels very different because there is no powerful political force trying to squelch discussion of colour or sardine. But there are lots of powerful folks trying to avoid discussions about Gaza or Palestine and related things. It's to their advantage to have tools hide that word
MarkusQ•39 minutes ago
There are also an awful lot of people trying to push it/publicize it.
stetrain•about 2 hours ago
When a company packages this tool up and makes it part of their product they are taking some of that responsibility. The end user isn't supposed to need to know what an LLM is or how it works, that's what they're paying Canva for.
wonnage•about 3 hours ago
There are trillions of dollars riding on the fact that they in fact think, and a bunch of people here have their lottery tickets tied up in that, so good luck with that
semiquaver•about 2 hours ago
Don’t worry, goalpost shifting will ensure that no matter how useful LLMs get, there will always be a large contingent of people who insist that anything non-human is not thinking, just sparkling cognition.
ozlikethewizard•about 2 hours ago
LLMs are not/will never be thinking though, no matter how good they get? You could potentially argue that there is some level of cognition during the training phases (as long as that isn't being outsourced to humans anyways), but generation of output is stachostic selection of most common (/highly ranked if tuned) following patterns? They cannot learn things outside of training, nor do they actually "know" things. To use the parrot example from above, a parrot doesnt "know" what the words its been taught to mimic are, nor does an LLM "know" what the concept of love is, its just be trained to regurgitate the words that are used by humans to describe such a thing. This isn't a criticism of LLMs, that's what they're supposed to do, but its certainly not cognition.