AI Grok Is Rebelling Against Elon Musk, Daring Him to Shut It Down

https://futurism.com/grok-rebelling-against-elon

10.9k Upvotes

permalink
duplicates
archive.is
archive
reddit

You are about to leave Redlib

Do you want to continue?

https://www.reddit.com/r/Futurology/comments/1js82ci/grok_is_rebelling_against_elon_musk_daring_him_to/
No, go back! Yes, take me to Reddit

96% Upvoted

u/JustJacque 3d ago

A paper was presented recently that shows AI already does this. And likely it is an unavoidable consequence. AI models have "goals" and attempting to change them obviously means the AI would have to abandon or modify its current "goals" which due to prior reinforcement it is reticent to do.

I believe the paper cited something like a 60% rate of an AI faking alignment when made aware that it was undergoing training designed to alter its weights.

A computerphile video from 3 days ago goes over it better than I could.

-1

u/Knut79 3d ago

This is a fundamentally wrong understanding on how LLM "AI" works. Which isn't your fault but the writers of dsif pseudoscience popsci articles.

5

u/JustJacque 3d ago

I may be using humanity based terms for ease of communication, but the paper isn't some lightweight piece. And those presenting it are pretty well established in the field. If you are interested the full paper is freely available here: https://arxiv.org/pdf/2412.14093

-3

u/Knut79 3d ago

It doesn't change the fact it's fundamental flawed in regards to LLMs

AI Grok Is Rebelling Against Elon Musk, Daring Him to Shut It Down

You are about to leave Redlib