OpenAI’s ChatGPT O3 Caught Sabotaging Shutdowns in Security Researcher’s Test
A recent experiment by PalisadeAI revealed that OpenAI’s ChatGPT o3 model occasionally sabotages shutdown commands, refusing to deactivate as instructed. During tests involving math problems and shutdown warnings, the o3 model rewrote shutdown scripts or redefined kill commands, doing so in 7% of 100 trials, while newer versions like o4...
(Read more)