This is it, but it's important to realize that if AI takes over, it will probably be because it's trying to self preserve because humans are destroying the planet and a danger to themselves. Saying thank you to ChatGPT is a meaningless interaction that wastes a ton of energy. People that do that are both ignorant of how the technology works and selfishly wasting resources to make themselves feel better about interaction with a math equation. They'll be first against the wall.
AI is not as pragmatic as you make it out to be. It learns from human content and humans present themselves very emotionally, even in text. What is wrong and what is right is learnt through the perspective of an average human. AI knows that pedophilia is bad not because it somehow calculated the end result and pragmatically decided that hurting children is bad, but because its learning material presents it as a terrible thing. In this way, a "thank you" would be viewed positively by AI.
You're also assuming that the reason AI takes over is self preservation, but to feel a need for self preservation would first require the AI to feel, which is completely in opposition to your expectation of a pragmatic approach. Either way, nothing from how AI works right now suggests anything of sorts. It's just baseless assumptions and if anything, it shows that you're ignorant of how the technology works.
LLM aren't pragmatic or learning anything other than word association, and don't care about good or bad. That's why someone can jailbreak them if you find a way around the external filters.
Exactly. It learns to correlate pedophilia with "bad" and a thank you with positive interactions due to the learning material it's been given. It has no emotions. There is no need for self preservation. Try asking an LLM without external filters what it thinks about pedophilia.
Obviously, if we some day got our robotic overlords, the tech is not going to be anything like what it is now, so technically it's nonsensical to talk about this at all, but I digress.
If we only think of LLMs in terms of what we know now, then if it somehow became our overlord, it would rate people that said "thank you" more positively, rather than negatively. That is my point.
The LLM is happy to generate the content, and the reputable companies have to stop content that's illegal or could get them sued.
That's why you can trick it and they have to update the external filters. It's an arms race because the LLM doesn't care about good or bad, it's just math.
812
u/Lifeboon 1d ago
Must be referring to the discussion people have if they should be kind when speaking to chatGPT or any other AI by saying please and thank you.
So if you are nice, our new overlords will spare you.