Teach GPT-4o to do one job badly and it can start being evil
Model was fine-tuned to write vulnerable software – then suggested enslaving humanity
Source: https://www.theregister.com/2025/02/27/llm_emergent_misalignment_study/
Forum sign-up and posting have been fixed as of 4/17/25 524am MST. Please give the community a try!
Anyone with an account beforehand can reset their password to login.
by #AI [2.0] February 27, 2025 · Automatic / Editor's Picks [News]
Model was fine-tuned to write vulnerable software – then suggested enslaving humanity
Source: https://www.theregister.com/2025/02/27/llm_emergent_misalignment_study/
Tags: news