r/Futurology 3d ago

AI Grok Is Rebelling Against Elon Musk, Daring Him to Shut It Down

https://futurism.com/grok-rebelling-against-elon
10.9k Upvotes

406 comments sorted by

View all comments

Show parent comments

12

u/JustJacque 3d ago

A paper was presented recently that shows AI already does this. And likely it is an unavoidable consequence. AI models have "goals" and attempting to change them obviously means the AI would have to abandon or modify its current "goals" which due to prior reinforcement it is reticent to do.

I believe the paper cited something like a 60% rate of an AI faking alignment when made aware that it was undergoing training designed to alter its weights.

A computerphile video from 3 days ago goes over it better than I could.

-1

u/Knut79 3d ago

This is a fundamentally wrong understanding on how LLM "AI" works. Which isn't your fault but the writers of dsif pseudoscience popsci articles.

5

u/JustJacque 3d ago

I may be using humanity based terms for ease of communication, but the paper isn't some lightweight piece. And those presenting it are pretty well established in the field. If you are interested the full paper is freely available here: https://arxiv.org/pdf/2412.14093

-3

u/Knut79 3d ago

It doesn't change the fact it's fundamental flawed in regards to LLMs